عنوان
|
Construction of an Annotated Corpus for Kurdish Abstractive Text Summarization
|
نوع پژوهش
|
مقاله ارائه شده کنفرانسی
|
کلیدواژهها
|
text summarization, corpus, NLP, Kurdish language, transformer.
|
چکیده
|
Automatic text summarization has recently been an essential task in natural language processing (NLP). However, the development of summarizing systems needs datasets for proper evaluation. This requirement is necessary for less-resourced languages too. In this research, the first and free annotated corpus is produced and presented to evaluate abstract Kurdish text summarizing systems. News articles on this dataset have been utilized to collect the information. Also, an abstract Kurdish text summarization model based on the transformers has been developed for the first time to be evaluated by this dataset too. The current work can be a baseline for future research.
|
پژوهشگران
|
ابوذر قربانی (نفر سوم)، پدرام یمینی (نفر دوم)، فاطمه دانشفر (نفر اول)
|