Lead Big Data Developer
The primary business objective of this project is to modernize and scale the enterprise data platform by migrating the existing on-premises HDFS and Hive-based ecosystem to a cloud object storage solution.
This transition aims to reduce infrastructure maintenance overhead, improve scalability, and support advanced analytics through Apache Iceberg, which offers improved performance, versioned datasets, and native compatibility with modern data processing engines.
Essential functions
Lead and mentor a team of data engineers, providing technical direction and career guidance.
Define the target data platform architecture for migrating from on-prem HDFS/Hive to Cloud Object Storage (e.g., AWS S3, Azure Data Lake Storage, or GCP Cloud Storage).
Select and integrate cloud-based compute and query engines (e.g., Spark).
Lead the design of ingestion, transformation, and storage patterns optimized for scalability, cost-efficiency, and performance in the cloud.
Define security, encryption, and compliance controls for sensitive enterprise data in the cloud.
Develop and own the migration roadmap, including phased transition from on-prem to cloud while minimizing business disruption.
Oversee data migration strategies (bulk historical loads, incremental sync, and cutover).
Define and enforce coding standards, CI/CD pipelines, and automated testing for data pipelines.
Partner with Data Architects, Cloud Engineers, and Security teams to align platform design with enterprise standards
Qualifications
Proven experience leading data engineering teams, including distributed teams across multiple geographies and time zones.
Effective in managing cross-team collaboration with architects, product managers, and operations.
Knowledge of Scala and Python
Experience with Apache Spark (batch & streaming)
Deep knowledge of HDFS internals and migration strategies.
Experience with Apache Iceberg (or similar table formats like Delta Lake / Apache Hudi) for schema evolution, ACID transactions, and time travel.
Running Spark and/or Flink jobs on Kubernetes (e.g., Spark-on-K8s operator, Flink-on-K8s).
Experience with distributed blob storages like Ceph or AWS S3 and similar
Building ingestion, transformation, and enrichment pipelines for large-scale datasets.
Infrastructure-as-Code (Terraform, Helm) for provisioning data infrastructure.
Strong communication skills
Availability to join evening calls (till 21:00 EET)
Would be a plus
Experience with Apache Flink
We offer
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package - medical insurance, sports
- Corporate social events
- Professional development opportunities
- Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.Apply to the position
Thank you!
You applied for the position Lead Big Data Developer successfully. We will get back to you soon. Have a great day!
Something went wrong...
There are possible difficulties with connection or other issues. Please try to use another browser (it's recommended to use the latest version of Google Chrome browser). If the problem still persists, please send your application to cv@griddynamics.com
RetrySomething went wrong...
Please double-check the information filled in the form, and make sure to provide valid data.
RetryDon’t see the right opportunity?
Contact us anyway and let’s talk! To apply, send your resume and cover letter to jobs@griddynamics.com
Grid Dynamics is an equal opportunity employer. We are committed to creating an inclusive environment for all employees during their employment and for all candidates during the application process.
All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on, age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. All employment is decided on the basis of qualifications, merit, and business need.