Senior ML Engineer
We are seeking a highly skilled Senior ML Engineer with hands-on experience building intelligent agents and operationalizing cloud-native AI applications. In this high-impact role, you will take ownership of deploying AI-driven solutions using the Google Cloud Agent Development Kit (ADK) and Vertex AI. You will act as a key driver in designing, developing, and establishing robust AI Ops, automated compliance gates, and continuous monitoring workflows. This is a unique opportunity to work at the intersection of advanced cloud infrastructure and emerging Artificial Intelligence, shaping the AI agent ecosystem for the #1 home improvement retailer in the U.S. and a global e-commerce giant serving millions of daily users.
Essential functions
AI Agent Infrastructure: Design, implement, and maintain secure, scalable AI agent infrastructure on Google Cloud using Vertex AI and modern agent orchestration frameworks.
Operationalizing AI (AIOps): Build and enforce automated model deployment, continuous monitoring workflows, and model lifecycle management across the platform.
Identity & Access Governance: Develop and enforce secure IAM frameworks, including role-based access control (RBAC), least-privilege configurations, and cross-cloud alignment (Azure AD and Google Workspace).
Model Execution Boundaries: Review, stage, and secure custom MCP (Model Context Protocol) actions, ensuring safe, compliant, and well-bounded model execution.
CI/CD & Infrastructure Automation: Architect and manage automated compliance gates, CI/CD pipelines, and infrastructure deployment utilizing Terraform and GitHub Actions.
Observability & Monitoring: Implement robust real-time observability, policy enforcement, and alerting systems for containerized AI workflows to ensure maximum system reliability.
Qualifications
Cloud AI Mastery: Deep hands-on experience with cloud-based ML/AI platforms, specifically Google Cloud, Vertex AI, and AI Agent engines.
Agentic Frameworks: Proven track record of building and orchestrating intelligent agents using tools like the Google Cloud Agent Development Kit (ADK) and Model Context Protocol (MCP).
Enterprise Security & Architecture: Strong familiarity with enterprise-grade security concepts, including workload identity federation, short-lived tokens, VPC, and private endpoints.
Infrastructure as Code (IaC): Advanced skills in Terraform and CI/CD workflows (GitHub Actions) to ensure infrastructure is tightly controlled during agent deployment.
Coding & Scripting: Strong proficiency in Python for backend development, AI automation, and handling distributed message workflows via Google Pub/Sub.
Observability Tools: Practical experience setting up enterprise observability and monitoring stacks (e.g., OpenTelemetry, Grafana, LangSmith, PagerDuty).
Would be a plus
Containerization & Scale: Hands-on experience with serverless architectures and containerized environments, specifically Google Kubernetes Engine (GKE) and Docker.
Big Data Ecosystem: Familiarity with high-throughput data storage and search engines like Elasticsearch and BigTable.
Policy-as-Code: Understanding of real-time policy enforcement and frameworks like OPA (Open Policy Agent).
Retail or Supply Chain Domain: Experience engineering high-load, distributed systems within large-scale retail, supply chain logistics, or e-commerce platforms.
We offer
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package - medical insurance, sports
- Corporate social events
- Professional development opportunities
- Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.Apply to the position
Thank you!
You applied for the position Senior ML Engineer successfully. We will get back to you soon. Have a great day!
Something went wrong...
There are possible difficulties with connection or other issues. Please try to use another browser (it's recommended to use the latest version of Google Chrome browser). If the problem still persists, please send your application to cv@griddynamics.com
RetrySomething went wrong...
Please double-check the information filled in the form, and make sure to provide valid data.
RetryDon’t see the right opportunity?
Contact us anyway and let’s talk! To apply, send your resume and cover letter to jobs@griddynamics.com
Grid Dynamics is an equal opportunity employer. We are committed to creating an inclusive environment for all employees during their employment and for all candidates during the application process.
All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on, age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. All employment is decided on the basis of qualifications, merit, and business need.
