Somali lexicon & grammar

AFTA Lex — Somali lexicon and grammar engine

Linguistic infrastructure for Somali text — lexicon, morphology, grammar, and spelling services that make Somali machine-readable and searchable.

Headwords

70k+

Curated Somali entries with linguistic features and variants.

Morphology

Parsed forms

Inflections, derivations, and patterns for NLP pipelines.

Coverage

Dialects & domains

Tags for region, register, and usage context.

Lexicon & morphology engine

AFTA Lex provides the linguistic data layer needed for Somali NLP: tokenization, tagging, ranking, and contextual understanding.

  • Machine-readable lexicon with POS tags and features
  • Morphological analyzer for forms and lemmas
  • APIs for search, indexing, and language tools

Grammar & spelling services

Grammar- and spelling-aware components for editors and applications.

  • Contextual grammar hints and suggestions
  • Spell-check APIs tuned for Somali words and patterns
  • Integration hooks for writing tools and content platforms

Where it is used

Search & discovery Education platforms Government portals Research & corpus work

Built with partners

We work with linguists, educators, and cultural institutions to keep AFTA Lex aligned with real Somali usage, new terminology, and dialectal variation.

💬