Bijankhan Corpus
The Bijankhan corpus (Persian: پیکرهٔ بیجنخان) is a tagged corpus that is suitable for natural language processing (NLP) research on the Persian language. This collection is gathered from daily news and common texts. In this collection all documents are categorized into different subjects such as political, cultural, etc.; in about 4300 different subject categories. The corpus contains about 2.6 million manually tagged words with a tag set that contains 550 Persian part-of-speech tags.
Link from a Wikipage to another Wikipage
primaryTopic
Bijankhan Corpus
The Bijankhan corpus (Persian: پیکرهٔ بیجنخان) is a tagged corpus that is suitable for natural language processing (NLP) research on the Persian language. This collection is gathered from daily news and common texts. In this collection all documents are categorized into different subjects such as political, cultural, etc.; in about 4300 different subject categories. The corpus contains about 2.6 million manually tagged words with a tag set that contains 550 Persian part-of-speech tags.
has abstract
The Bijankhan corpus (Persian: ...... is contributions in this area.
@en
Link from a Wikipage to an external page
Wikipage page ID
14,570,613
page length (characters) of wiki page
Wikipage revision ID
1,021,770,018
Link from a Wikipage to another Wikipage
wikiPageUsesTemplate
hypernym
type
comment
The Bijankhan corpus (Persian: ...... 0 Persian part-of-speech tags.
@en
label
Bijankhan Corpus
@en