DATA & LICENSING
Data & licensing for AFTA AI Somali speech & text
AFTA AI curates Somali speech and text datasets that power AFTA Voices, AFTA Lex, and AFTA Data Studio.
These corpora are proprietary and made available under research and commercial licenses.
This page explains who can access the datasets, which license applies, and how to request access.
01 • Overview
AFTA AI Somali Speech & Text Corpus
A unified set of Somali audio, transcripts, lexicon entries, and linguistic annotations used to build
modern Somali ASR, TTS, and NLP systems.
- Speech recordings with aligned transcripts
- Lexicon & morphology tables for AFTA Lex
- Clean corpora for search, translation, and evaluation
All datasets remain the intellectual property of AFTA AI and are not public domain.
02 • Research access
Research-only license (SASR-L)
Universities, labs, and non-profit institutions may request access under the
AFTA AI Somali Speech & Text Dataset License — SASR-L.
- Non-commercial research and evaluation only
- No redistribution or uploading to public repositories
- No training of competing Somali foundation models
- Attribution to AFTA AI in publications
The full legal terms are defined in the SASR-L license.
View Dataset License
03 • Commercial licensing
Commercial usage & model training
For production systems, commercial APIs, or large-scale model training on Somali data,
AFTA AI offers tailored commercial licenses.
- Rights to train or fine-tune ASR, TTS, and LLMs
- Deployment in products, call centers, and platforms
- Optional private cloud or on-prem dataset hosting
- Tiered pricing based on scale and usage
Commercial terms are negotiated per project. Pricing is available on request.
Contact enterprise licensing
04 • How to request
How to request dataset access
To begin, submit a dataset access request with details about your institution, project, and intended use.
Our team will review your application and respond with the appropriate license.
- Fill out the dataset access request form
- Confirm whether use is research-only or commercial
- Agree to SASR-L or negotiate a commercial agreement
For joint research projects or strategic partnerships, contact
data@aftaai.com.
For privacy and processing details, see the
Privacy Policy.
For general questions, contact info@aftaai.com.