Senior Software Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Software Engineer (AI) (Python/FastAPI): Building and scaling an agent-powered AI platform with an accent on multi-agent workflows, RAG implementations, and model-agnostic routing. Focus on designing async REST/WebSocket APIs, implementing tool-using agents, and optimizing Postgres for high-concurrency access.
Location: Remote (Quebec, Canada). Must have daily overlap of 09:00 – 13:00 EST
Company
is a leading web technology company serving millions of customers globally through brands like Bluehost, HostGator, and Web.com.
What you will do
- Design and scale async REST/WebSocket APIs using Python 3.11+ and FastAPI with clean vertical-slice architecture.
- Implement multi-agent workflows with Semantic Kernel to route traffic among specialized LLM agents.
- Integrate LLM providers (OpenAI, Google Gemini) behind a provider-agnostic layer for A/B and cost-aware routing.
- Deliver Retrieval-Augmented Generation (RAG) using vector stores such as Azure AI Search, pgvector, or Chroma.
- Evolve schemas with SQLModel/SQLAlchemy 2 and tune Postgres for high concurrency async access.
- Maintain robust CI/CD pipelines using Bitbucket and Jenkins to deploy Dockerized services.
Requirements
- 5+ years of experience building production APIs in Python and 2+ years with FastAPI (or similar async stack).
- Deep knowledge of async I/O, Pydantic v2, dependency injection, and observability.
- Hands-on experience with Semantic Kernel or comparable agent frameworks.
- Practical RAG implementations using Azure AI Search, pgvector, or Chroma.
- Strong Postgres skills, including SQLModel/SQLAlchemy 2 and Alembic migrations.
- Remote work readiness with daily overlap of at least 09:00 – 13:00 EST.
Nice to have
- Experience with event/message queues such as RabbitMQ, Azure Service Bus, or Kafka.
- Knowledge of observability stacks like Grafana or LangFuse for LLM cost governance.
Culture & Benefits
- Opportunity to build the agent-powered backbone of a platform serving millions of users.
- AI-assisted development environment utilizing GitHub Copilot and Cursor.
- High-performance engineering standards with a focus on low latency (p95 < 100 ms).
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →