Skip to content

For enterprise

Voice AI that works in the languages your customers actually speak.

ASR, TTS, and call-centre intelligence APIs for Nigerian English, Pidgin, Yoruba, Hausa, and Igbo — built on ethically-sourced cooperative data with a documented consent chain.

Why existing solutions fail in Nigeria

01

Whisper and Google fail on Nigerian indigenous languages

Whisper large-v3 produces approximately 55% word error rate on Igbo. Google Speech-to-Text does not support Yoruba or Hausa. AWS Transcribe does not support any Nigerian indigenous language. For voice AI products serving Nigerian users, this is the difference between usable and unusable.

02

Existing African language datasets cannot be used commercially

Most African language speech data is research-tier — collected for academic use, with no documented contributor consent for commercial AI training. Enterprise legal teams cannot approve them. Your AI ethics audit will flag them. We built the consent chain that makes commercial use possible.

03

There is no domain-specific data for the use cases that matter

A bank needs financial-services speech data. A telco needs customer-service dialogue. A health programme needs PHC interaction. Generic speech corpora do not deliver this. Africa's Voice collects against ten domains per language, with explicit metadata tagging.

What we offer

ASR API

From $0.003 / audio minute

Real-time and async transcription endpoints. Five Nigerian languages at launch. SFTP drop integration available alongside REST. SLA-backed for enterprise pilots.

  • 5 Nigerian languages at launch
  • Real-time + async endpoints
  • Code-switching tolerant
  • SFTP drop integration available

Datasets

From $5,000 per dataset

Validated speech corpora with paired human-verified transcripts, domain tags, and full consent provenance. Research tier for academic use; commercial tier for AI training.

  • Audio + paired transcripts
  • Domain-tagged metadata
  • NDPA 2023 compliant
  • Tiered: research / commercial

Enterprise CCI

From $2,000 / month

Call Centre Intelligence: 100% transcription, sentiment scoring, compliance flagging. Designed for CBN-regulated banks and telcos with disclosure requirements.

  • Full call transcription
  • Compliance flagging
  • Multi-language quality scoring
  • On-premise deployment available

The differentiator

The consent and governance story

Every audio file in our corpus has a documented consent record. Contributors are members of the Africa's Voice Data Cooperative. They consented at one of three tiers — research only, commercial use, or public domain. They can withdraw consent at any time. The audit chain is queryable per file. This is what your AI ethics audit needs to see — and what most African language datasets cannot provide.

API preview

POST /v0/transcribe
Authorization: Bearer {api_key}
Content-Type: audio/wav

→  Response:
{
  "language_detected": "ibo",
  "language_confidence": 0.94,
  "transcript": "Kọọ ihe i mere n'oge ụtụtụ taa.",
  "duration_seconds": 5.2,
  "model_version": "av-asr-v0.1"
}

Discovery call

Start a discovery conversation

20 minutes. We will not pitch — we want to understand your problem first. Reply within 48 business hours.

We'll never share your information. Submissions go directly to the founding team.