Back to feed
Dev.to
Dev.to
5/10/2026
How We Built a Sub-200ms Multilingual Chat System Translating 100+ Languages with Our Own LLM

How We Built a Sub-200ms Multilingual Chat System Translating 100+ Languages with Our Own LLM

Short summary

A hotel chain built a custom multilingual LLM fine-tuned on hospitality data after commercial translation APIs failed on tone and domain vocabulary. The system translates guest-staff conversations in 100+ languages with sub-200ms latency, running 4–6× faster than commercial APIs across 700+ hotels. The model was trained on 18 months of curated hospitality data including hotel collateral, real conversations, and synthetic dialogue corrected by native speakers.

  • Custom iRoom LLM model outperforms commercial translation APIs on hospitality tasks by 4–6× in speed and accuracy
  • Sub-200ms latency achieved through end-to-end infrastructure control and fine-tuning on domain-specific data (100+ languages)
  • Two-year build process driven by three critical failures of commercial APIs: tone preservation, domain vocabulary, and operational constraints (pricing, rate limits)

Generated with AI, which can make mistakes.

Is this a good recommendation for you?

Explore more