Middle/Senior Data Engineer (AWS)

Posted 10 Days Ago
Be an Early Applicant
Hiring Remotely in Metropolitan Area Apt, ON
Remote
Mid level
Artificial Intelligence • Information Technology • Consulting
The Role
As a Data Engineer at Provectus, you will build and maintain data pipelines, collaborate with Data Scientists, and create data models. Your role includes working with cloud platforms and various data tools to manage large data sets and support Generative AI applications and internal projects.
Summary Generated by Built In

We are seeking a talented and experienced Data Engineer to join our team at Provectus. As part of our diverse practices, including Data, Machine Learning, DevOps, Application Development, and QA, you will collaborate with a multidisciplinary team of data engineers, machine learning engineers, and application developers. You will encounter numerous technical challenges and have the opportunity to contribute to Provectus’ open source projects, build internal solutions, and engage in R&D activities, providing an excellent environment for professional growth.

Requirements

  • Experience in data engineering;
  • Experience working with Cloud Solutions (preferably AWS, also GCP or Azure);
  • Experience with Cloud Data Platforms (e.g., Snowflake, Databricks);
  • Proficiency with Infrastructure as Code (IaC) technologies like Terraform or AWS CloudFormation;
  • Experience handling real-time and batch data flow and data warehousing with tools and technologies like Airflow, Dagster, Kafka, Apache Druid, Spark, dbt, etc.;
  • Proficiency in programming languages relevant to data engineering such as Python and SQL;
  • Experience in building scalable APIs;
  • Experience in building Generative AI Applications (e.g., chatbots, RAG systems);
  • Familiarity with Data Governance aspects like Quality, Discovery, Lineage, Security, Business Glossary, Modeling, Master Data, and Cost Optimization;
  • Advanced or Fluent English skills;
  • Strong problem-solving skills and the ability to work collaboratively in a fast-paced environment.

Nice to Have:

  • Relevant AWS, GCP, Azure, Databricks certifications;
  • Knowledge of BI Tools (Power BI, QuickSight, Looker, Tableau, etc.);
  • Experience in building Data Solutions in a Data Mesh architecture;
  • Familiarity with classical Machine Learning tasks and tools (e.g., OCR, AWS SageMaker, MLFlow, etc.).

Responsibilities:

  • Collaborate closely with clients to deeply understand their existing IT environments, applications, business requirements, and digital transformation goals;
  • Collect and manage large volumes of varied data sets;
  • Work directly with Data Scientists and ML Engineers to create robust and resilient data pipelines that feed Data Products;
  • Define data models that integrate disparate data across the organization;
  • Design, implement, and maintain ETL/ELT data pipelines;
  • Perform data transformations using tools such as Spark, Trino, and AWS Athena to handle large volumes of data efficiently;
  • Develop, continuously test and deploy Data API Products with Python and frameworks like Flask or FastAPI.

Top Skills

Python
SQL
The Company
HQ: Palo Alto, CA
572 Employees
On-site Workplace
Year Founded: 2010

What We Do

Provectus is an Artificial Intelligence consultancy and solutions provider, helping businesses achieve their objectives through AI.

We are recognized by industry think tanks as a leading provider of AI solutions in specific business domains, driven by sophisticated IT service management and tech innovation. Provectus is a value driver and a trusted partner for our clients and employees.

Provectus is an AWS Premier Consulting Partner with competencies in Data & Analytics, DevOps, and Machine Learning. We design and build AI solutions for industry-specific use cases, Data and Machine Learning foundation, Cloud transformation, and DevOps adoption.

Similar Jobs

Motorola Solutions Logo Motorola Solutions

Marketing and Inside Sales Data Analyst

Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Remote
Ontario, ON, CAN
21000 Employees

Block Logo Block

Staff Machine Learning Engineer - Conversational AI

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Remote
Hybrid
8 Locations
12000 Employees

Block Logo Block

Staff Data Scientist, Dashboard Analytics

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Remote
Hybrid
8 Locations
12000 Employees

Block Logo Block

Staff Data Engineer, Public Web

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Remote
Hybrid
8 Locations
12000 Employees

Similar Companies Hiring

3Play Media Thumbnail
Software • Social Impact • Professional Services • Information Technology • Artificial Intelligence
Boston, MA
211 Employees
Rokt Thumbnail
Software • Marketing Tech • eCommerce • Digital Media • Artificial Intelligence
New York, NY
500 Employees
Enverus Thumbnail
Software • Information Technology • Energy • Big Data • Analytics
Austin, TX
1700 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account