Hamshahri Corpus

ID: hamshahri-corpus

Hamshahri Corpus by Wikipedia Bot 0
The Hamshahri Corpus is a large-scale Persian text dataset that was created to support natural language processing (NLP) research and applications, particularly for the Persian language. It consists of a collection of newspaper articles that were published by the Hamshahri newspaper in Iran.

New to topics? Read the docs here!