Staff DevOps Engineer
Description
At Sonatus, we’re driving the transformation to AI-enabled software-defined vehicles. Traditional automotive software methods can’t keep pace with consumer expectations shaped by the mobile industry—where features evolve rapidly, update seamlessly, and improve continuously. That’s why leading OEMs trust Sonatus to accelerate this shift. Our technology is already in production across more than 6 million vehicles on the road today and rapidly expanding. Headquartered in Sunnyvale, CA, with 250+ employees worldwide, Sonatus combines the agility of a fast-growing company with the scale and impact of an established partner. Backed by strong funding and proven by global deployment, we’re solving some of the most interesting and complex challenges in the industry. Join us and help redefine what’s possible as we shape the future of mobility. The Role Sonatus builds the software platform for AI-enabled, software-defined vehicles. Our Korea engineering site is responsible for critical product delivery to major automotive OEMs — it generates 30% of company revenue and operates dedicated build infrastructure, lab hardware, and test environments that must run reliably on-premises. We are seeking a Staff DevOps Engineer based in Seoul to own and modernize the Korea site's on-premises infrastructure: bare-metal Jenkins build servers, enterprise storage, lab networking, and the automation that ties CI/CD pipelines to physical test hardware. This is a hands-on engineering role where you will be the primary infrastructure owner for the Korea site, working closely with the global Engineering Operations team (Sunnyvale, Dublin) to bring consistency and modern practices to a critical production environment. Today, the Korea infrastructure is functional but manually configured — Jenkins controllers and agents set up by hand, storage replication that is undocumented, and release automation built on large monolithic scripts. You will modernize this into a codified, observable, maintainable platform while preserving the team's existing workflows and institutional knowledge. What You'll Own On-premises build infrastructure — Own the Korea Jenkins controller and physical build agents. Manage capacity planning, server health, and reliability for the bare-metal servers that produce ECU firmware builds. Coordinate AWS overflow capacity when on-prem resources are constrained. Lab infrastructure and hardware integration — Own the integration between CI/CD pipelines and physical test hardware (ECUs, evaluation boards). Support board provisioning, health monitoring, and the automation that connects builds to hardware test execution. Work with QA engineers to ensure test environments are reliable and reproducible. Storage and replication — Bring up and operate Pure Storage (FlashArray) for build workspaces, artifact caching, and test output. Own the NFS layer and multi-site replication between Korea, Sunnyvale, and Dublin. Document and stabilize storage infrastructure that currently has single-person dependencies. Infrastructure as Code migration — Lead the migration from manual server configuration to codified infrastructure. Terraform for provisioning, Ansible for configuration management. The Korea site currently has zero IaC — you will design the target state and execute the migration without disrupting active builds. CI/CD pipeline support — Maintain and improve the Korea release automation pipelines (Jenkins shared libraries, Groovy-based release workflows). Partner with the global CI/CD team on consolidation and standardization efforts while ensuring Korea-specific build and release requirements are met. Monitoring and observability — Deploy monitoring (Nagios, Prometheus/Grafana, or equivalent) for all Korea on-prem infrastructure: build servers, storage arrays, network equipment, and lab hardware. Replace the current model of "someone notices it's broken" with proactive alerting and dashboards. Datacenter operations — Own the physical environment for the Korea server room: coordinate with facilities on power and cooling constraints, manage rack layouts, and plan capacity for future hardware deployments. What You Bring
Requirements
8+ years in infrastructure, DevOps, or SRE roles, with significant time managing on-premises and bare-metal infrastructure — not exclusively cloud-native environments Linux systems administration — You debug production servers over SSH daily: networking, storage, process management, systemd, log analysis On-premises infrastructure — Hands-on experience with physical servers, rack management, storage arrays (NFS/SAN), and datacenter operations including power, cooling, and capacity planning CI/CD platform administration — Experience operating Jenkins at scale (distributed agents, shared libraries, credential management) or equivalent enterprise CI/CD platform Infrastructure as Code — Practical experience with Terraform and Ansible (or equivalent). You've migrated environments from manual to codified, not just maintained existing IaC Networking — Working knowledge of VLANs, DNS, VPN/site-to-site connectivity, and how lab networks integrate with cloud environments Scripting and automation — Proficiency in Python and Bash. Comfortable working in large Groovy codebases Storage administration — Experience with enterprise storage (Pure Storage, NetApp, or similar), NFS configuration, and backup/replication strategies Nice to Have Embedded systems build toolchains (Bazel, Yocto, CMake cross-compilation) Virtualization technologies (VMware vSphere, KVM) Hardware-in-the-loop or automotive test infrastructure experience Security scanning integration (Coverity, BlackDuck, or similar SAST/SCA tools) Experience with Kubernetes in hybrid on-prem/cloud environments Korean language proficiency (business conducted in both Korean and English) Your First 90 Days Days 1–30: Learn and Stabilize Gain operational access to all Korea Jenkins infrastructure: controller, build agents, credentials, and storage Map every piece of on-prem hardware: servers, storage, network equipment, and test boards Complete a credential and access audit — identify and eliminate personal tokens and single-person dependencies Meet with the Korea engineering and QA teams to understand their build, test, and release workflows firsthand Document the current NAS replication setup and storage architecture Days 31–60: Codify and Instrument Stand up the first Terraform-managed infrastructure component for the Korea site Begin Ansible-based configuration management for build agents, replacing manual setup Deploy monitoring for all Korea on-prem infrastructure (servers, storage, lab hardware) Support Pure Storage bringup (pending HVAC resolution) and NFS volume configuration Produce a written infrastructure assessment and gap analysis for the Korea site Days 61–90: Modernize and Scale Establish a repeatable, codified agent provisioning process aligned with the global standard Define storage policies for build workspaces and artifacts on Pure Storage Begin modernizing release automation scripts toward maintainable, testable pipeline code Present a 6-month Korea infrastructure roadmap to engineering leadership About Sonatus Sonatus is building the operating system for the software-defined vehicle. Our platform enables automakers to deploy, update, and manage vehicle software at scale. Our technology is already in production across more than 6 million vehicles on the road today. The Korea site is a critical hub for OEM delivery, and the infrastructure you own will directly enable the teams building and testing firmware for production automotive hardware. Logistics
Location
Seoul, South Korea — on-site required. This role requires physical access to lab and datacenter hardware.
Travel
Occasional travel to Sunnyvale and Dublin lab sites (estimated 1–2 trips per year).