Dataset
The roots of Remède…
The data
folder is destined to the linguistics resources used by Remède.
- Folder
fr/
words.txt
: List of ~1 000 000 words, semi separatedipa.json
: For a key ‘word’, returns his IPA- Generated from
data/IPA.txt
: a text file of format[word]\t[ipa]
byscripts/pre_generate_ressources.py
- Generated from
- The sames resources, for each locale are situated in the folders
en
ect…
The
data/remede.db
file is not included in git files, see Setup to download it.
data/remede.schema.json
: JSON schema of Remède documentdata/custom_words.json
: File to add custom words… The words documents must follow the Remède Document Schemadata/custom_words.schema.json
: its JSON schema
data/remede.db
: A sqlite database (reference) (french) data/remede.[locale].db
: A sqlite database (reference) (for locale, eg remede.en.db
) data/drime.db
: The Open Lexicon french database rewrote by the project drime. Useful to get precise word metadata like syllables.