Мэтч & Сопровод
Покажет вашу совместимость и напишет письмо
Описание вакансии
Data Specialist
Company
VIA Science, Inc.
Conditions
13 hours agoSeniorSalary: 85K - 105KHybrid Data Science Jobs by VIA Science, Inc.
Skills
Data Quality Numpy R Airflow Dbt Visualization Postgresql Machine Learning Data Pipeline Dagster Aws Azure Pandas Sql Python Communication Documentation Data Analysis Statistics Elt Anomaly Detection Etl Ai
About the Role
You will turn raw, complex data into trusted, AI-enhanced intelligence that powers data products. You will explore and profile customer data, translate domain knowledge into data infrastructure requirements, and design end-to-end ETL/ELT pipelines with automation and self-healing. You will run automated and manual quality control, document assumptions, and build AI features such as automated insights, anomaly detection, and natural-language interfaces. You will evaluate AI/ML outputs against domain expectations, design human-in-the-loop checks, and deliver interactive analyses, reports, and visualizations to customers. You will also identify opportunities to improve internal data cleaning and quality-assessment tools and help automate repetitive data tasks.
Requirements
- 3+ years of experience in a data-driven role or equivalent in data-related research projects
- Bachelor’s or Master’s degree in science, mathematics, engineering, or a data-driven field
- Competence in Python, R, or an equivalent programming language
- Competence in at least two of: SQL/PostgreSQL, NumPy/pandas, Dagster/Airflow/dbt, AWS/Azure
- Ability to translate complex data findings into clear, compelling narratives
- Strong communication skills to decompose operational workflows into repeatable steps
- Passion for data integrity and transforming raw inputs into trusted datasets
- Self-starter with demonstrated ability to learn new technologies quickly
- Experience with generative AI tools (e.g., AWS Bedrock, LangChain) is a plus
- Experience with testing frameworks (e.g., pytest) is a plus
Responsibilities
- Understand the data and the domain
- Partner with client delivery and customers to translate domain knowledge into data infrastructure requirements
- Explore raw customer data and profile files, columns, and statistical characteristics
- Design and implement end-to-end AI-enhanced ETL/ELT pipelines
- Coordinate with stakeholders and customers to resolve missing information and discrepancies
- Run quality control on data and data products through automated tests and targeted manual review
- Document assumptions and decisions to maintain traceability
- Build AI into data products including automated insights, anomaly detection, and AI-assisted data quality checks
- Evaluate AI/ML outputs and design human-in-the-loop verification
- Deliver interactive data analysis, data quality reports, statistical analyses, and visualizations
- Contribute improvements to internal tools for data cleaning and data quality assessment
Benefits
- 401(k) plan with up to 5% employer contribution
- Fully funded health benefits including vision and dental from day one for whole family
- Up to 24 weeks paid parental leave, 4-week paid ramp-back, and a $10,000 family forming benefit
- Flexible vacation policy with no set annual limit, Summer Fridays, extended December holiday period
- Flexible work options with access to offices and ability to work remotely as needed
- Opportunity to work remotely from eligible locations for up to two months per year
- Individualized mentoring, growth opportunities, and access to learning programs
- Dedicated wellness advisor
- Transit benefits and in-person events for team connection
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Текст вакансии взят без изменений