Назад
2 дня назад

Senior AI/LLM Engineer (Python)

8 500$
Формат работы
remote (Global)
Тип работы
fulltime
Грейд
senior
Английский
b2
vacancy_detail.hirify_telegram_tooltipВакансия из Telegram канала -

Мэтч & Сопровод

Покажет вашу совместимость и напишет письмо

Описание вакансии

#llm #python #remote

Senior AI/LLM Engineer (Architecture, LLM, Python - MUST)

Salary is up to 8500 usd gross

REMOTE, full-time, long termed, b2b contract, location of the candidate and legal entity, bank account of the candidate – outside of Russia, Belarus and Turkey



HiO is an AI Business Partner for small and medium business owners, unifying customer conversations from every platform into a single, intelligent co-worker that proactively surfaces growth opportunities and flags what actually needs the owner’s attention. The market demand for this kind of intelligent automation is explosive, but executing it flawlessly is a deep technical challenge. We aren't just building a simple LLM wrapper; we are architecting a complex, highly reliable system capable of orchestrating sophisticated agentic workflows at scale. To meet this massive market pull, we’ve raised a $16M seed round from the visionary VCs behind OpenAI and Anthropic, alongside Wix as a strategic investor. We’re a high-talent, fast-moving team building the future of how small businesses thrive, and we’re just getting started.
 
About the Role
We’re building agentic AI for real-world CX. As an AI Engineer at HiO, you’ll craft scalable, low-latency AI systems - from LLM orchestration to real-time APIs - using Python, LangGraph, and modern cloud stacks. Fast-paced, high-impact, and deeply hands-on.
 
What you’ll do
· Design, build, and maintain real-time AI-powered APIs and services in Python.
· Architect, optimize, and deploy LLM workflows leveraging LangGraph or similar orchestration frameworks.
· Develop agentic logic and reasoning pipelines for conversational and task-based automation.
· Build scalable infrastructure to support multi-model, low-latency, and streaming AI experiences.
· Collaborate with backend and product teams to translate complex customer workflows into reliable AI systems.
· Ensure high performance, reliability, and data security across production workloads.
· Deploy and manage applications on modern cloud environments (AWS / GCP)
· Implement monitoring, logging, and fault-tolerance mechanisms for production-grade AI systems.
· Contribute to architectural and product decisions, balancing innovation, safety, and
scalability.
 
What We’re Looking For
· BSc or higher in Computer Science, Engineering, or a related technical field.
· 7+ years of production development experience
· 5+ years of experience in ML engineering, building AI/LLM-based systems in production.
· Deep experience with Python (async, multiprocessing, decorators, type hints, debugging).
· Proven hands-on work with LLMs (3+ years), including streaming inference, prompt management, or multi-agent systems.
· Experience with LangGraph, LangChain, or similar orchestration frameworks.
· Strong experience with MLOps, microservices, and distributed architectures.
· Solid grasp of SQL/NoSQL databases, message queues, and scalable data design.
· Experience deploying applications to AWS, GCP, or similar cloud platforms.
· Ownership mindset - comfortable operating across the stack and figuring things out fast.
· Excited to work in a fast-paced startup environment where priorities shift quickly and
impact is immediate.
B2 English

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник -