Skip to content

Releases: aymara/lima-models

Segmentation and Tagging - multi-objective

28 Jul 17:36
Compare
Choose a tag to compare

This is a partial release of Deeplima models, including segmentation and tagging models, as well as a few lemmatization models.
These models have been trained with a multi-objective optimization strategy. The best ones for tokens, upos and lemmas have been selected.

Tagging models

03 Jan 23:27
f4d543d
Compare
Choose a tag to compare
Tagging models Pre-release
Pre-release
v0.1.6-beta-morph

Update README.md

Embeddings for English (1/10 of C4)

20 Jul 22:47
f4d543d
Compare
Choose a tag to compare
Pre-release
v0.1.6-beta-en-c4medium

Update README.md

Tokenization models

21 Nov 22:05
f4d543d
Compare
Choose a tag to compare
Tokenization models Pre-release
Pre-release

Tokenization only (without sentence segmentation) models trained on all trainable corpora from UD 2.8 collection (67 languages), 8 models for each corpus.
Deeplima commit: aymara/lima@40ab9ad
LSTM hidden state: 8

v0.1.5

06 Aug 09:37
f4d543d
Compare
Choose a tag to compare
  • lemmatization models updated
  • speed improved
  • Albanian language added

Models for 60+ languages

04 Feb 12:00
Compare
Choose a tag to compare
Pre-release
v0.1.4-beta

Fix path to copyright

MorphoSyntax models for English and French

16 Sep 11:05
Compare
Choose a tag to compare

Starting from this version models are packaged per language.

v0.1.2-beta

30 Aug 13:57
Compare
Choose a tag to compare
v0.1.2-beta Pre-release
Pre-release

Models are installed to /usr/.

Tokenization models for English

14 Jun 13:22
Compare
Choose a tag to compare
Pre-release
v0.1.1-beta

English tokenization models