summaryrefslogtreecommitdiff
path: root/README
diff options
context:
space:
mode:
Diffstat (limited to 'README')
-rw-r--r--README14
1 files changed, 11 insertions, 3 deletions
diff --git a/README b/README
index 60d9e94..d147b6d 100644
--- a/README
+++ b/README
@@ -1,7 +1,15 @@
This is a GNU aspell dictionary for Nynorsk. The wordlist is adopted
-unchanged from the Norwegian Language Bank, licenced under CC BY
-4.0[1], and may be downloaded from the National Library of Norway's
-website[2].
+from the Norwegian Language Bank, licenced under CC BY 4.0[1], and may
+be downloaded from the National Library of Norway's website[2]. It has
+been modified to remove words and phrases unsupported by aspell. The
+file nn.wl is produced from the Language Bank's fullformer_2012.txt as
+follows:
+
+cat norsk_ordbank/fullformer_2012.txt | iconv -f ISO-8859-1 -t UTF-8\
+| cut -f3
+
+sed -E -i .bak -e '/ /d' -e '/^-/d' -e '/-$/d' -e '/[^a-zA-Z.-]/d' -e\
+ '/[.-]{2,}/d' nn.wl
The metadata files and configure script are adapted from Morten Bo
Johansen's aspell-da[3].