<学術雑誌論文>
CEFR-based Lexical Simplification Dataset

作成者
本文言語
出版者
発行日
収録物名
開始ページ
終了ページ
会議情報
出版タイプ
アクセス権
権利関係
関連DOI
関連DOI
関連URI
関連HDL
概要 This study creates a language dataset for lexical simplification based on Common European Framework of References for Languages (CEFR) levels (CEFR-LS). Lexical simplification has continued to be one ...of the important tasks for language learning and education.There are several language resources for lexical simplification that are available for generating rules and creating simplifiers using machine learning. However, these resources are not tailored to language education with word levels and lists of candidates tending to be subjective. Different from these, the present study constructs a CEFR-LS whose target and candidate words are assigned CEFR levels using CEFR-J wordlists and English Vocabulary Profile, and candidates are selected using an online thesaurus. Since CEFR is widely used around the world, using CEFR levels makes it possible to apply a simplification method based on our dataset to language education directly. CEFR-LS currently includes 406 targets and 4912 candidates. To evaluate the validity of CEFR-LS for machine learning, two basic models are employed for selecting candidates and the results are presented as a reference for future users of the dataset.続きを見る

本文ファイル

pdf LREC2018_238 pdf 195 KB 440  

詳細

レコードID
関連URI
関連ISBN
主題
登録日 2018.06.14
更新日 2018.06.14

この資料を見た人はこんな資料も見ています