Senior Site Reliability Engineer
We are looking for a Site Reliability Engineer (SRE) who will work with cross-functional teams to support and enhance the reliability, performance, and scalability of e-commerce platforms. This role involves active participation in on-call rotations, incident response, application-level troubleshooting, and close coordination with internal teams and third-party vendors.
Essential functions
- Provide 24/7 support in on-call rotations to respond to infrastructure and application incidents, primarily during assigned rotation weeks.
- Collaborate with DevOps, Development, and NOC teams to troubleshoot and resolve application and infrastructure issues.
- Use Splunk for monitoring, creating dashboards, and proactive troubleshooting of application and infrastructure logs.
- Act as a first responder to incidents, facilitating communication and root cause analysis, especially in collaboration with vendors like Akamai, Epsilon, and Fiserv.
- Coordinate incident response across internal teams and external vendors, ensuring efficient troubleshooting and resolution of critical issues.
- Document incident response processes and contribute to improving SRE best practices.
Qualifications
Minimum of 3+ years in SRE or similar roles.
Strong knowledge of AWS (ECS, Lambda, EC2, S3), Jenkins, and Terraform.
Proficiency in Splunk or similar monitoring tools.
Excellent communication and coordination skills, especially in incident response.
Strong debugging and troubleshooting skills in application and infrastructure-level incidents.
Experience with .NET and Java-based applications and familiarity with mobile app backends.
We offer
Opportunity to work on cutting-edge projects
Work with a highly motivated and dedicated team
Competitive salary
Flexible schedule
Benefits package - medical insurance
Corporate social events
Professional development opportunities
Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.Apply to the position
Thank you!
You applied for the position Senior Site Reliability Engineer successfully. We will get back to you soon. Have a great day!
Something went wrong...
There are possible difficulties with connection or other issues. Please try to use another browser (it's recommended to use the latest version of Google Chrome browser). If the problem still persists, please send your application to cv@griddynamics.com
RetrySomething went wrong...
Please double-check the information filled in the form, and make sure to provide valid data.
RetryDon’t see the right opportunity?
Contact us anyway and let’s talk! To apply, send your resume and cover letter to jobs@griddynamics.com
Grid Dynamics is an equal opportunity employer. We are committed to creating an inclusive environment for all employees during their employment and for all candidates during the application process.
All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on, age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. All employment is decided on the basis of qualifications, merit, and business need.
Get in touch
Let's connect! How can we reach you?
Thank you!
It is very important to be in touch with you.
We will get back to you soon. Have a great day!
Something went wrong...
There are possible difficulties with connection or other issues.
Please try again after some time.