Language barriers cost gaming communities millions of shared moments every day. Miwa eliminates the gap — per-speaker Discord voice translation with romaji, a three-agent CrewAI suggestion pipeline, and style-matched LLM refinement, targeting under 800ms on AMD MI300X.
The Miwa overlay sits transparently above your game or browser. Per-speaker cards appear as each person talks, fading out when they go silent.
Full end-to-end walkthrough recorded live on AMD MI300X hardware.
| RTX 5090 — Consumer | NVIDIA H100 — Data center | AMD MI300X ✓ Miwa | |
|---|---|---|---|
| VRAM | 32 GB GDDR7X | 80 GB HBM2e | 192 GB HBM3 |
| Llama 3.3 70B | INT4 only — quality loss | FP8/INT8 — barely fits | Full FP16, single GPU |
| Bandwidth | 1.79 TB/s | 3.35 TB/s | 5.3 TB/s |
| Multi-GPU | Required for 70B | Often needed | Not needed |
| Ecosystem | CUDA only | CUDA only | ROCm — open source |
--gpu-memory-utilization 0.80, reserving ~38 GB for Whisper. The default 100% leaves zero VRAM for STT and fails silently at runtime.