仕事内容
<p> </p>
<h3><strong>About the Role</strong></h3>
<p><span style="font-size: 12pt;">Together AI is defining the infrastructure layer for the next generation of voice applications. Our Voice AI platform powers production-grade, real-time voice agents at scale — and we're looking for a Staff Platform Engineer to own the architecture that makes it possible.</span></p>
<p><span style="font-size: 12pt;">This isn't a role about maintaining what exists. You'll set the technical direction for how developers interact with Together's voice platform — from the real-time API primitives they build on, to the autoscaling systems that keep latency SLOs intact under unpredictable load, to the multi-provider abstraction layer that makes our platform uniquely powerful. Voice infrastructure is categorically harder than text inference: bidirectional audio streams, stateful long-lived connections, millisecond latency requirements, and complex multi-model routing don't forgive architectural shortcuts. You'll bring the judgment to get this right the first time, at scale.</span></p>
<p><span style="font-size: 12pt;">This is a foundational hire on a small, high-conviction team. The decisions you make in this role will define the platform architecture for years.</span></p>
<h3><strong>Responsibilities</strong></h3>
<ul>
<li><span style="font-size: 12pt;">Own the architecture and reliability of Together's real-time API layer — set the technical direction for WebSocket and HTTP streaming APIs powering STT and TTS at scale; establish the reliability bar (connection lifecycle, backpressure, graceful degradation, reconnection) that production voice agents — contact centers, AI agents, communication platforms — depend on.</span></li>
<li><span style="font-size: 12pt;">Lead autoscaling architecture for latency-sensitive voice workloads — design and ship orchestration systems that handle bursty, real-time traffic across tens of thousands of GPUs; solve the hard problems at the intersection of c
求めるスキル
Python
Kubernetes
Rust
React
TypeScript