CommIT

Senior Data Engineer

Reposted Yesterday

Be an Early Applicant

Remote

Hiring Remotely in Poland

Senior level

Remote

Hiring Remotely in Poland

Senior level

The Senior Data Engineer will design and implement batch and streaming data pipelines, evolve AWS data lake architecture, and mentor peers while ensuring optimal performance and cost efficiency.

The summary above was generated by AI

We are building a greenfield analytics platform supporting both batch and real-time data processing. We are looking for a Senior Data Engineer who can design, implement, and evolve scalable data systems in AWS.

This role combines hands-on development, architectural decision-making, and platform ownership.

Core Responsibilities:

Design and implement batch and streaming data pipelines using Apache Spark.
Build and evolve a scalable AWS-based data lake architecture.
Develop and maintain real-time data processing systems (event-driven pipelines).
Own performance tuning and cost optimization of Spark workloads.
Define best practices for data modeling, partitioning, and schema evolution.
Implement monitoring, observability, and data quality controls.
Contribute to infrastructure automation and CI/CD for data workflows.
Participate in architectural decisions and mentor other engineers.

Required Qualifications:

5+ years of experience in Data Engineering.
Strong hands-on experience with Apache Spark (including Structured Streaming).
Experience building both batch and streaming pipelines in production environments.
Proven experience designing AWS-based data lake architectures: S3, EMR, Glue, Athena.
Experience with event streaming platforms such as Apache Kafka or Amazon Kinesis.
Experience implementing lakehouse formats such as Delta Lake.
Strong understanding of partitioning strategies and schema evolution.
Experience using SparkUI and AWS CloudWatch for profiling and optimization.
Strong understanding of Spark performance tuning (shuffle, skew, memory, partitioning).
Proven track record of cost optimization in AWS environments.
Experience with Docker and CI/CD pipelines.
Experience with Infrastructure as Code: Terraform, AWS CDK.
Familiarity with monitoring and observability practices.
Experience in the Financial domain.
Experience running Spark workloads on Kubernetes.
Experience implementing data quality frameworks or metadata/lineage systems.
English - B2, Ukrainian- Native

Top Skills

Amazon Kinesis

Apache Kafka

Spark

AWS

Aws Cdk

Aws Cloudwatch

Ci/Cd

Delta Lake

Docker

Sparkui

Terraform

Similar Jobs

Globaldev Group

Senior Data Engineer

Yesterday

Remote

Senior level

Information Technology • Software

The Senior Data Engineer will design and optimize ETL/data pipelines, automate workflows, integrate data sources, and translate business needs into data solutions using Azure tools.

Top Skills: Azure Data FactoryAzure DatabricksPysparkSQL

Kpler

Senior Data Engineer

Yesterday

Remote or Hybrid

Senior level

Analytics

As a Senior Data Engineer, you will design, implement, and maintain real-time data pipelines and cloud-based infrastructure, mentor engineers, and collaborate across teams to enhance data products and services.

Top Skills: AWSAzureFlinkGCPJavaKafkaKinesisPulumiPythonRabbitMQSparkTerraform

TechBiz Global

Senior Data Engineer

2 Days Ago

Remote

Senior level

Agency

The Senior Data Engineer will build and optimize data pipelines, collaborate with engineering teams, and ensure data quality and scalability.

Top Skills: AWSAzureCi/CdDatabricksDbtDelta LakeGCPGitKafkaPythonSparkSQLTerraform

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.