diff options
author | Laura Orvokki Kursula <lav@vampires.gay> | 2024-08-24 14:32:53 +0200 |
---|---|---|
committer | Laura Orvokki Kursula <lav@vampires.gay> | 2024-08-24 14:32:53 +0200 |
commit | 1370074bd345fa54297b79d726dc7fa37453ec3d (patch) | |
tree | 831922450f235a6cbf875d8c630acaaf86329b44 /README | |
parent | 86969e644c86401a8a0526f45da832e260149766 (diff) | |
download | aspell-nn-1370074bd345fa54297b79d726dc7fa37453ec3d.tar.gz aspell-nn-1370074bd345fa54297b79d726dc7fa37453ec3d.zip |
Filter unsupported words from nn.wl for cleaner compilation.
Diffstat (limited to 'README')
-rw-r--r-- | README | 14 |
1 files changed, 11 insertions, 3 deletions
@@ -1,7 +1,15 @@ This is a GNU aspell dictionary for Nynorsk. The wordlist is adopted -unchanged from the Norwegian Language Bank, licenced under CC BY -4.0[1], and may be downloaded from the National Library of Norway's -website[2]. +from the Norwegian Language Bank, licenced under CC BY 4.0[1], and may +be downloaded from the National Library of Norway's website[2]. It has +been modified to remove words and phrases unsupported by aspell. The +file nn.wl is produced from the Language Bank's fullformer_2012.txt as +follows: + +cat norsk_ordbank/fullformer_2012.txt | iconv -f ISO-8859-1 -t UTF-8\ +| cut -f3 + +sed -E -i .bak -e '/ /d' -e '/^-/d' -e '/-$/d' -e '/[^a-zA-Z.-]/d' -e\ + '/[.-]{2,}/d' nn.wl The metadata files and configure script are adapted from Morten Bo Johansen's aspell-da[3]. |