DATA & LICENSING

Data & licensing for AFTA AI Somali speech & text

AFTA AI curates Somali speech and text datasets that power AFTA Voices, AFTA Lex, and AFTA Data Studio. These corpora are proprietary and made available under research and commercial licenses.

This page explains who can access the datasets, which license applies, and how to request access.

01 • Overview

AFTA AI Somali Speech & Text Corpus

A unified set of Somali audio, transcripts, lexicon entries, and linguistic annotations used to build modern Somali ASR, TTS, and NLP systems.

  • Speech recordings with aligned transcripts
  • Lexicon & morphology tables for AFTA Lex
  • Clean corpora for search, translation, and evaluation

All datasets remain the intellectual property of AFTA AI and are not public domain.

02 • Research access

Research-only license (SASR-L)

Universities, labs, and non-profit institutions may request access under the AFTA AI Somali Speech & Text Dataset License — SASR-L.

  • Non-commercial research and evaluation only
  • No redistribution or uploading to public repositories
  • No training of competing Somali foundation models
  • Attribution to AFTA AI in publications

The full legal terms are defined in the SASR-L license.

View Dataset License
03 • Commercial licensing

Commercial usage & model training

For production systems, commercial APIs, or large-scale model training on Somali data, AFTA AI offers tailored commercial licenses.

  • Rights to train or fine-tune ASR, TTS, and LLMs
  • Deployment in products, call centers, and platforms
  • Optional private cloud or on-prem dataset hosting
  • Tiered pricing based on scale and usage

Commercial terms are negotiated per project. Pricing is available on request.

Contact enterprise licensing
04 • How to request

How to request dataset access

To begin, submit a dataset access request with details about your institution, project, and intended use. Our team will review your application and respond with the appropriate license.

  • Fill out the dataset access request form
  • Confirm whether use is research-only or commercial
  • Agree to SASR-L or negotiate a commercial agreement

For joint research projects or strategic partnerships, contact data@aftaai.com.

For privacy and processing details, see the Privacy Policy. For general questions, contact info@aftaai.com.