Hardware
Every component engineered for seamless human interaction.
01, Sensors
High-precision 3D infrared depth sensors identify and track faces in real time. Personalized greetings, age estimation, and attention tracking, all processed on-device for maximum privacy.
02, Audio
Turn Detection algorithm detects end of speech, Noise Cancellation algorithm eliminates noise, and Voice Isolation algorithm isolates only the speaker's voice, accurately capturing conversations even in noisy environments like airports and shopping malls where even humans struggle to hear.
03, Payment
Fully integrated contactless payment terminal supports all major credit cards, debit cards, Apple Pay*, Google Pay*, and mobile wallets. PCI-DSS compliant with end-to-end encryption.
* Available only in supported countries.
04, Scanner
Dual-mode optical and radio-frequency scanner reads QR codes, barcodes (1D/2D), and RFID/NFC tags. Instant ticket validation, identity verification, and loyalty card support.
05, Output
Industrial-grade thermal printer produces tickets, receipts, boarding passes, and wristbands in under 3 seconds. Supports custom branding, QR codes on print, and automatic paper cutting.
Frame-accurate mouth movements synchronized with speech for natural, human‑like presence.
Trigger real‑time actions from conversations: show maps, fetch data, control workflows.
Personalize identity, style, and languages to match brand and context.
Robust background noise suppression for clear audio in public spaces.
Lead capture, qualification, and handoff integrations to streamline sales funnels.
Manage agents, monitor conversations, and iterate faster with a clutter‑free workspace. Everything you need is a click away, so you can build and ship in minutes.
Responses are generated and delivered in real time, even during high‑concurrency traffic. Our system is optimized for sub‑second turnaround across speech, reasoning, and voice synthesis.
Sub‑100ms turnaround for natural back‑and‑forth.
Switch languages seamlessly with automatic detection.
Prosody and tone control for human‑level nuance.
One codepath, all form factors. Optimized layouts and inputs for every screen.




AI‑powered noise reduction in real time for crystal‑clear conversations.
Connect with the tools and systems you already use. Our platform integrates seamlessly with your existing infrastructure for maximum efficiency.
Advanced indoor positioning and wayfinding technology for seamless navigation within complex environments.
Lightning-fast data sync and real-time updates across connected systems.
Seamless QR code scanning and generation for instant access and information sharing.
Intelligent call management with automated closure and follow-up capabilities.
Branded idle experiences: announcements, campaign loops, and scheduled content when the kiosk isn't in an active conversation.
Intelligent sales management and customer relationship tools for enhanced conversion rates.
Deploy across public and private spaces. Designed for clarity, reliability, and scale.
Patient guidance, triage support, multilingual check-in.
Concierge, bookings, local tips, 24/7 self-service.
KYC, queue management, product guidance.
Wayfinding, live status, multilingual passenger assist.
Aisle guidance, stock checks, offers & membership.
Campus navigation, info desks, admissions help.
Shift handover, safety info, digital SOPs.
Exhibit guidance, tickets, multilingual tours.
Seat finding, live updates, crowd info.
Deployment options
Choose the rollout model that fits your security, latency, and data‑residency requirements.
Cloud
Fastest time‑to‑value with managed updates, scaling, and monitoring—ideal for distributed fleets.
On‑Prem
Run inside your own network for full control, strict data residency, and air‑gapped deployments when required.
Select from our ready avatars or bring your own. Ultra‑smooth and multilingual.
Three simple steps. Clean and fast. No complex setup required.
Start with a single clear face photo or pick from our stock library. No file upload needed now.
Natural, multilingual voices. Select tone and pace that fit your brand.
Generate a lifelike avatar in seconds. Perfect lip‑sync, realistic motion.
Pick the runtime your avatar will use.
From idea to interface in a few lines of code.
import { createClient } from '@selam-ai/js-sdk';const client = createClient('<YOUR_SESSION_TOKEN>');client.stream();
APIs
Text to Speech, Speech to Text, and Voice Cancellation models built for low latency at global scale. Drop-in primitives for noisy environments, multilingual users, and always-on kiosks.
Text to Speech
Natural, multilingual voices with consistent tone and streaming output for real-time interactions.
import { createClient } from '@selam-ai/js-sdk';const client = createClient('<YOUR_SESSION_TOKEN>');const audio = await client.tts.synthesize({text: 'Hello from Selam.AI',voice: 'alloy',format: 'mp3',});
Speech to Text
Accurate transcription with low latency and robust handling for real-world audio in public spaces.

Voice Cancellation
Noise suppression and voice isolation tuned for kiosks—clear speech in crowded, high-noise environments.
import { createClient } from '@selam-ai/js-sdk';const client = createClient('<YOUR_SESSION_TOKEN>');const cleaned = await client.vcm.process({audioTrack: micTrack,mode: 'kiosk',});
Responses are generated and delivered in real time, even during high‑concurrency traffic. Our system is optimized for sub‑second turnaround across speech, reasoning, and voice synthesis.
All data, both in transit and at rest, is protected with state‑of‑the‑art encryption and strict isolation to ensure complete security and prevent unauthorized access. Runs 100% on your own servers-even in fully internet‑isolated environments-so you stay in complete control. Powered entirely by our own proprietary LLM, STT, and TTS models with no dependency on any third‑party AI service.
Deterministic pipelines prevent unexpected states; consistency is guaranteed across sessions and environments, operating predictably even under bursty, real‑time workloads.
Policy‑bound responses with strict safeguards for enterprise reliability.
Our references
Trusted by enterprises worldwide. Deployed across airports, retail, hospitality, and telecom at global scale. Built for reliability, security, and real-time performance.





Answers to the most common questions.
Responses stream instantly and complete in sub‑second time for speech, reasoning and synthesis under typical loads.
We support 30+ languages out of the box and can extend to new locales on request.
Yes. We provide simple SDKs and webhooks; common CRMs, analytics and auth providers work seamlessly.
All data is encrypted in transit and at rest with strict isolation, audit logging and role‑based access.
Yes, you can create custom avatars using your own images or videos. Our platform makes it easy to generate personalized, professional avatars.
We offer 24/7 technical support, comprehensive documentation, and dedicated account managers for enterprise customers.
We offer guided evaluations-experience the product in our offices or via a secure online session with our team.