Data Engineer Remote

Innodata

Tempo integralCanadaNot specified
BigQuery CloudStorage Dataflow Pub/Sub Looker SQL Python ETL DataLakes DataWarehouses GCP AWS DataGovernance Compliance MLOps

Company Overview

Innodata (Nasdaq: INOD) is a global data engineering company. We believe that data and Artificial Intelligence (AI) are inextricably linked. Our mission is to enable the responsible advancement of artificial intelligence by providing the data, evaluation frameworks, and human expertise required to build AI systems that can be trusted at scale. We provide a range of transferable solutions, platforms, and services for Generative AI / AI builders and adopters. In every relationship, we honor our 36+ year legacy delivering the highest quality data and outstanding outcomes for our customers.

Job Title

Data Engineer

Remote Location

Remote - Canada

Scope of the Role

We are seeking a Data Engineer to design and build enterprise data warehouses, data lakes, and pipelines that power data-driven decision-making for data center supply chain and real estate operations. This role is responsible for creating scalable, secure, and optimized ETL infrastructure on GCP/AWS, while enabling advanced AI/ML use cases such as RAG, copilots, and agentic AI for predictive analytics and workflow automation.

Job Responsibilities

  • Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI
  • Build ETL scripts using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems
  • Develop and optimize data pipelines for ingestion, transformation, and loading into enterprise data lakes and warehouses
  • Build and extend end-to-end data and BI solutions, spanning extraction, storage, transformation, and visualization layers
  • Partner with supply chain, real estate, and AI/ML teams to provide pipelines for AI solutions (e.g., RAG ingestion, Copilot integration, multi-agent workflows)
  • Ensure data governance, lineage, and compliance across supply chain datasets
  • Continuously optimize query performance, ETL processes, and pipeline reliability
  • Support advanced AI/ML use cases including RAG pipelines, copilots, and agentic AI for predictive analytics

Requirements

  • Advanced proficiency in SQL (complex queries, optimization) and Python (data engineering, scripting, APIs)
  • Experience building ETL/ELT pipelines operating on structured and unstructured data sources
  • Knowledge of enterprise data warehouse and data lake architectures
  • Exposure to data pipelines for AI/ML (vector DB ingestion, embeddings, RAG pipelines, copilots, agents)
  • Familiarity with supply chain or data center operations data is a strong plus
  • Strong hands-on expertise with GCP services: BigQuery, Dataflow, Pub/Sub, Cloud Storage, Looker/BI
  • Experience with cloud platforms (GCP preferred, AWS acceptable)
  • Understanding of data governance and compliance requirements

Preferred Qualifications

  • Experience with ML Engineering
  • Proficiency in data visualization tools (Looker, Tableau, Power BI)
  • Familiarity with MLOps practices
  • Knowledge of ERP, procurement, logistics, and facility management systems

Benefits

  • Competitive salary and compensation package
  • Remote work flexibility
  • Opportunity to work with cutting-edge AI/ML technologies
  • Collaborative and innovative work environment
  • Career growth opportunities in a globally recognized company

How to Apply

To apply for this position, please submit your application through our online portal. You will need to provide your resume/CV, LinkedIn profile, and complete the required information fields. Innodata is an equal opportunity employer.

Important Notice: Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process.