NL
<< Back

Native Language Identification

In this master thesis, I developed a Machine Learning algorithm that tried to predict the native language of the author of an English text. The algorithm was based on Support Vector Machines and used Liblinear as a utility.

Tech Stack

Java, Liblinear, LaTeX

Features

  • Loanwords
  • Lexemes
  • Lemmas
  • POS
  • Characters
  • Complex features

Afbeeldingen