Senior MLOps Engineer
The ideal candidate will be responsible for maintaining product and industry knowledge. You will work in a team-oriented environment that accelerates operational efficiency.
Essential functions
- Design and implement GPU optimization strategies to maximize utilization and reduce latency for ML workloads
- Develop and maintain distributed training pipelines using Ray framework for large-scale model development
- Manage and optimize ML infrastructure across multi-cloud environments focusing on cost-efficiency and scalability
- Build monitoring and profiling tools for GPU performance analysis and resource allocation optimization
- Collaborate with data scientists and ML engineers to streamline model training, inference, and deployment processes
- Implement best practices for workload orchestration, fault tolerance, and auto-scaling in cloud environments
- Stay current with GPU architectures, ML frameworks, and cloud technologies to drive continuous infrastructure improvements
Qualifications
5+ years of ML infrastructure experience with 3+ years focused on GPU optimization
Hands-on experience with AWS/EKS for ML workloads in production environments
Bachelor's or Master's degree in Computer Science, Engineering, or a related field
Proven expertise with Ray framework (Ray Train, Ray Tune, Ray Serve) for distributed ML computing
Strong CUDA programming skills for GPU performance optimization (cuDNN, TensorRT experience preferred)
Proficiency with deep learning frameworks (TensorFlow, PyTorch, JAX) and performance tuning
Experience with Kubernetes, Terraform, and infrastructure-as-code practices
Strong analytical and problem-solving skills for complex performance bottlenecks
Ability to collaborate effectively with data science, engineering, and DevOps teams
We offer
- Opportunity to work on cutting-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package - medical insurance, vision, dental, etc.
- Corporate social events
- Professional development opportunities
- Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.Apply to the position
Thank you!
You applied for the position Senior MLOps Engineer successfully. We will get back to you soon. Have a great day!
Something went wrong...
There are possible difficulties with connection or other issues. Please try to use another browser (it's recommended to use the latest version of Google Chrome browser). If the problem still persists, please send your application to cv@griddynamics.com
RetrySomething went wrong...
Please double-check the information filled in the form, and make sure to provide valid data.
RetryDon’t see the right opportunity?
Contact us anyway and let’s talk! To apply, send your resume and cover letter to jobs@griddynamics.com
Grid Dynamics is an equal opportunity employer. We are committed to creating an inclusive environment for all employees during their employment and for all candidates during the application process.
All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on, age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. All employment is decided on the basis of qualifications, merit, and business need.