Together AI

公式採用ページ

Machine Learning Engineer - Inference

0万円〜 0万円

San Francisco

正社員・契約社員

経験年数：

仕事内容

<h3><strong>About the Role</strong></h3> <p>Together AI is seeking a Machine Learning Engineer to join our<strong> </strong>Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference systems. This role involves working with state-of-the-art large language models models and ensuring they run efficiently and effectively at scale. If you are passionate about AI inference, PyTorch, and developing high-performance systems, we want to hear from you. This position offers the chance to collaborate closely with AI researchers and engineers to create cutting-edge AI solutions. Join us in shaping the future at Together AI!</p> <h3><strong>Responsibilities</strong></h3> <ul> <li>Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale.</li> <li>Develop and optimize runtime inference services for large-scale AI applications.</li> <li>Collaborate with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world.</li> <li>Conduct design and code reviews to ensure high standards of quality.</li> <li>Create services, tools, and developer documentation to support the inference engine.</li> <li>Implement robust and fault-tolerant systems for data ingestion and processing.</li> </ul> <h3><strong>Requirements</strong></h3> <ul> <li>3+ years of experience writing high-performance, well-tested, production-quality code.</li> <li>Proficiency with Python and PyTorch.</li> <li>Demonstrated experience in building high performance libraries and tooling.</li> <li>Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, storage, performance, and scale.</li> <li>Preferred: Knowledge of existing AI inference systems such as TGI, vLLM, TensorRT-LLM, Optimum</li> <li>Preferred: Knowledge of AI inference techniques such as speculative decoding.</li> <li>Preferred: Knowledge

必須要件

求めるスキル

Python PyTorch CUDA LLM Rust

勤務条件

勤務時間
雇用形態	正社員・契約社員
勤務地	San Francisco
リモートワーク	不可

Together AI 公式採用ページ掲載求人

この求人に応募する

11日前に掲載

公式ページで応募する

※ 企業の公式採用ページへ移動します

Together AI

Machine Learning Engineer - Inference

仕事内容

必須要件

求めるスキル

勤務条件

この求人に応募する

人気求人

おすすめコンテンツ

Together AI

Machine Learning Engineer - Inference

仕事内容

必須要件

求めるスキル

勤務条件

この求人に応募する

人気求人

おすすめコンテンツ

メールアドレスで無料会員登録

求職者ログイン

掲載企業様の方はこちら

企業様 新規登録

企業ログイン

求職者の方はこちら

パスワードリセット

企業様 パスワードリセット

新しいパスワードを設定

Cookieの使用について

Cookie設定

必須Cookie

分析Cookie

機能Cookie

企業様新規登録

企業様パスワードリセット