Forward Deployed Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Forward Deployed Engineer (AI): Developing and deploying high-performance AI inference solutions for regional customers with an accent on pre-sales technical leadership and PoV execution. Focus on designing reference architectures, optimizing LLM inference performance, and integrating Kubernetes-based AI stacks.
Location: Must be a Singapore Citizen and based in Singapore
Company
builds high-performance AI systems, including the Rebel 100™, natively supporting PyTorch and vLLM for production inference.
What you will do
- Lead technical customer engagements from initial conversations through PoV design and execution.
- Build product demos and benchmarking content for LLM inference and vision pipelines.
- Design and prototype reference architectures, including serving stack configuration and Kubernetes integration.
- Analyze end-to-end inference performance and optimize serving and routing strategies.
- Collaborate with customer infrastructure leads and platform engineers on architecture and deployment.
- Provide customer feedback to internal engineering teams to influence product direction.
Requirements
- Bachelor’s degree in Computer Science, Electrical Engineering, or a related field.
- Must be a Singapore Citizen.
- Minimum 3 years of experience in AI/ML systems deployment or solutions engineering.
- Hands-on experience with Kubernetes-based AI inference stacks and orchestration.
- Proficiency in Python and PyTorch.
- Fluent English communication skills.
Nice to have
- Experience in pre-sales roles such as Solutions Engineer or Sales Engineer.
- Knowledge of GPU infrastructure (NVIDIA DGX/HGX, NVLink, InfiniBand).
- Understanding of hardware acceleration (NPU, GPU) and model optimization techniques like quantization.
- Experience producing technical content including blogs and whitepapers.
Culture & Benefits
- Opportunity to work as part of a small, fast-moving regional team.
- Exposure to cutting-edge AI hardware and full-stack software.
- High degree of autonomy in representing the company at industry events and conferences.
Hiring process
- Application Review.
- Online Interview.
- On-site Interview including a technical exercise.
- Culture-Fit Interview.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →