Integrations
CallScribe for WhatsApp Business
WhatsApp voice notes in Arabic — transcribed via the WhatsApp Business Cloud API.
Last updated: April 2026
WhatsApp is how a large fraction of GCC business communication actually happens. Customer enquiries arrive as WhatsApp voice notes; sales follow-ups are voice messages; support escalations come in as 90-second audio rants. None of that lives in your CRM unless someone manually transcribes it. CallScribe integrates with the WhatsApp Business Cloud API to transcribe inbound (and optionally outbound) voice notes automatically and post the text into your workspace.
Where WhatsApp voice notes actually originate
In the GCC, WhatsApp is the default informal-business channel. A potential customer asks a sales question via voice note; a property tenant complains via voice note to a real-estate agency; a clinic patient describes symptoms in dialect. The audio is short (15 to 90 seconds typically), conversational, and almost always dialectal. None of the voice notes get transcribed by default — they sit in WhatsApp threads that one person can read, slowing every downstream operational process.
How the integration works (Cloud API vs. On-Premises API)
Meta provides two flavours of the WhatsApp Business API: Cloud API (Meta-hosted, the standard for new deployments) and the older On-Premises Business API (deprecated, sunsetting in 2025). CallScribe integrates with the Cloud API. You configure your Business Phone Number's webhook to point at CallScribe's endpoint. Inbound messages — including audio media IDs — are delivered to us; we GET the audio file via the Media endpoint, transcribe with our Arabic dialect-aware pipeline, and post the transcript back into a CallScribe project tied to the Business Phone Number.
For organisations using a Business Solution Provider (BSP) — Twilio, MessageBird, 360dialog, Infobip, Vonage — CallScribe integrates via the BSP's webhook forwarding rather than direct WhatsApp Cloud API. The end result is the same.
Audio retention rules and the 30-day window
Meta retains media (audio voice notes included) for 30 days from message receipt. After 30 days, the media-ID returns 410 Gone. CallScribe fetches and transcribes within minutes of receipt, so the 30-day window matters only for retroactive bulk imports — if you want to import six months of WhatsApp voice notes that already happened, the audio is mostly gone and only metadata remains.
CallScribe stores the transcript per its own retention rule. By default we do not retain the original audio file; if you need original audio retention beyond Meta's 30 days, enable original-audio storage on the project.
PDPL, customer consent, and disclosure
Voice notes are personal data. Under UAE PDPL and KSA PDPL, lawful basis for processing must be established. The standard practice is to disclose recording/transcription in your business's WhatsApp opt-in flow and in the WhatsApp Business Profile. CallScribe does not handle disclosure for you — that is on the customer-facing business; we sit downstream as a data processor under your DPA.
Use cases that pay back fast
Property-management voice notes from tenants get parsed into a complaint ticket with category and severity. Sales-team voice-note enquiries get auto-extracted into a CRM lead with intent classification. Customer-support voice-note tickets get prioritised by sentiment severity. Each of these saves a person from listening to every voice note linearly. The volume varies wildly: small businesses handle 10-50 voice notes per day; large hotel groups or property platforms handle thousands.
Audio quality and Arabic accuracy on WhatsApp
WhatsApp voice notes are encoded in OGG/Opus at typically 16kHz, 32kbps. Audio quality is generally fine for ASR — better than telephony — but background noise (street, café, car) is more variable than recorded business calls. WER on clear single-speaker Khaleeji voice notes is typically 9-13%; noisy outdoor recordings can rise to 18-25% but the model degrades gracefully rather than producing nonsense.
At a glance
- ✓Cloud API and BSP-mediated integration
- ✓30-day media-retrieval window — fetched within minutes
- ✓OGG/Opus voice notes handled natively
- ✓Per-business-number isolation in CallScribe projects
- ✓Inbound + optional outbound voice-note transcription
FAQs
Do I need to be a Meta Business verified entity?▾
Yes — using the WhatsApp Business Cloud API requires a verified Meta Business Account and an approved WhatsApp Business Phone Number. CallScribe does not handle that approval — it's a Meta process. Most active GCC SMBs already have one because WhatsApp is the de facto channel.
Can CallScribe transcribe outbound voice notes too?▾
Yes if you opt in. Outbound voice notes (sent by your reps to customers) are forwarded to CallScribe via the same webhook; transcription is mostly useful for QA — was the rep on-message, did they make any compliance-relevant claims.
Does CallScribe send replies back via WhatsApp?▾
No — CallScribe is read-only on WhatsApp. Replies are sent by your customer-platform (the BSP, Twilio, Infobip, or your direct WhatsApp Business app). We provide the transcript and analytics; the messaging side is not our domain.
How does CallScribe handle WhatsApp voice notes versus regular calls?▾
They are treated as separate audio sources but use the same Arabic dialect-aware ASR pipeline. Voice notes are typically shorter and noisier than recorded calls; sentiment scoring works the same way; diarization is moot because voice notes are single-speaker.
What happens if a voice note is in a language CallScribe doesn't cover?▾
CallScribe primarily covers Arabic and English. Hindi, Urdu, Tagalog, Tamil voice notes are language-detected at ingest and either passed through to a partner ASR provider (configurable) or returned as untranscribed with a language-detected metadata flag.
How is this priced?▾
Voice-note minutes count against your CallScribe plan minutes the same as call minutes. A typical small-business pattern of 30 voice notes per day at 45 seconds each is roughly 22.5 minutes per day or 670 minutes per month — fitting Scale ($79/mo, 3,000 min). Larger operations should plan for enterprise pricing.
5 min/mo free · No credit card · 8-12% WER on Khaleeji