Staff Systems Engineer (Cloud Operations & Support)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Systems Engineer (Cloud Operations & Support) (HPC/Cloud): Architecting and managing high-performance CPU and GPU clusters for cloud infrastructure with an accent on operational strategy, performance tuning, and multi-tenant service optimization. Focus on designing scalable HPC environments, automating system maintenance, and ensuring high availability across global sites.
Location: Must be based in the US (Remote, San Jose, CA, or Austin, TX)
Company
is a global leader in electronic design automation (EDA) software and hardware.
What you will do
- Architect, build, and optimize high-performance CPU and GPU clusters for the cloud.
- Deploy and manage multi-tenant cloud services across both private and public infrastructure.
- Drive the overall operational strategy for internal HPC clusters to improve efficiency and reporting.
- Collaborate with engineering teams to develop and implement solutions that optimize their working environment.
- Develop automation scripts using Python, Bash, or Perl to streamline deployment and maintenance.
- Implement monitoring solutions for system health, GPU utilization, and container performance.
Requirements
- 8+ years of technical experience architecting and managing Linux-based HPC environments.
- 3+ years of experience coordinating support and operations across multiple global geographies.
- Deep expertise in Linux system administration (RHEL preferred), including networking, storage, and performance tuning.
- Extensive hands-on experience with Docker, image management, and container orchestration.
- Proven experience in GPU Cluster Management, including installation and optimization over OpenStack.
- Proficiency in Python, Bash, or Perl for system automation and reporting.
Nice to have
- Direct Electronic Design Automation (EDA) experience.
Culture & Benefits
- Opportunity to work in a high-impact role developing leadership and innovation in technology.
- Collaborative environment focusing on customer success and productivity.
- Flexible location options within the USA (Remote or Onsite).
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →