Hamshahri Corpus

The Hamshahri Corpus (Persian: پیکره همشهری‎) is a sizable Persian corpus based on the Iranian newspaper Hamshahri, one of the first online Persian-language newspapers in Iran. It was initially collected and compiled by Ehsan Darrudi at DBRG Group of University of Tehran. Later, a team headed by Ale Ahmad built on this corpus and created the first Persian text collection suitable for information retrieval evaluation tasks.

Hamshahri Corpus

The Hamshahri Corpus (Persian: پیکره همشهری‎) is a sizable Persian corpus based on the Iranian newspaper Hamshahri, one of the first online Persian-language newspapers in Iran. It was initially collected and compiled by Ehsan Darrudi at DBRG Group of University of Tehran. Later, a team headed by Ale Ahmad built on this corpus and created the first Persian text collection suitable for information retrieval evaluation tasks.