Bianet: A Parallel News Corpus in Turkish, Kurdish and English
Abstract
The Bianet corpus, consisting of news articles in Turkish, English, and Kurdish, is evaluated for its effectiveness in bilingual and multilingual neural machine translation models.
We present a new open-source parallel corpus consisting of news articles collected from the Bianet magazine, an online newspaper that publishes Turkish news, often along with their translations in English and Kurdish. In this paper, we describe the collection process of the corpus and its statistical properties. We validate the benefit of using the Bianet corpus by evaluating bilingual and multilingual neural machine translation models in English-Turkish and English-Kurdish directions.
Get this paper in your agent:
hf papers read 1805.05095 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper