仕事内容
<div class="content-intro"><h2><strong>Who We Are</strong></h2>
<p>Lightning AI is the company behind PyTorch Lightning. Founded in 2019, we build an end-to-end platform for developing, training, and deploying AI systems—designed to take ideas from research to production with less friction.</p>
<p>Through our merger with Voltage Park, a neocloud and AI Factory, Lightning AI combines developer-first software with cost-efficient, large-scale compute. Teams get the tools they need for experimentation, training, and production inference, with security, observability, and control built in.</p>
<p>We serve solo researchers, startups, and large enterprises. Lightning AI operates globally with offices in New York City, San Francisco, Seattle, and London, and is backed by Coatue, Index Ventures, Bain Capital Ventures, and Firstminute.</p>
<div class="c-message_actions__container c-message__actions"> </div></div><h2><strong>What We're Looking For</strong></h2>
<p>Lightning AI is seeking <strong>Network Operations Center (NOC) Analysts </strong>to support 24/7 operations across select high-performance compute data centers with advanced monitoring infrastructure.<strong> </strong>This is a technically focused role centered on telemetry analysis, infrastructure monitoring, and independent diagnosis of compute, network, and hardware systems.</p>
<p>You will serve as the first line of technical response — analyzing telemetry signals, diagnosing system anomalies, and troubleshooting Linux and network-layer issues before escalating with clear, actionable findings. You will operate with a high degree of independence, applying sound judgment in situations that often extend beyond predefined runbooks.</p>
<p>This role offers a clear pathway toward positions in network engineering, site reliability, or data center operations, and the opportunity work with next-generation AI hardware and some of the most advanced compute infrastructure deployed today.</p>
<p><em>This