📦Dataset

The roots of Remède...

The data folder is destined to the linguistics resources used by Remède.

Folder fr/
- words.txt: List of ~1 000 000 words, semi separated
- ipa.json: For a key 'word', returns his IPA
  - Generated from data/IPA.txt: a text file of format [word]\t[ipa] by scripts/pre_generate_ressources.py
The sames resources, for each locale are situated in the folders en ect...

The data/remede.db file is not included in git files, see Setup to download it.

data/remede.schema.json: JSON schema of Remède document
data/custom_words.json: File to add custom words... The words documents must follow the Remède Document Schema
- data/custom_words.schema.json: its JSON schema
data/remede.db: A sqlite database (reference) (french)
data/remede.[locale].db: A sqlite database (reference) (for locale, eg remede.en.db)
data/drime.db: The Open Lexicon french database rewrote by the project drime. Useful to get precise word metadata like syllables.

Last updated 1 year ago

Was this helpful?