Skip to content

General Information

Job Title
Senior Staff, Site Reliability Engineer
Job ID
17592
Country
India
City
Bengaluru
Date Posted
21-May-2026
Job Category
Engineering
Job Subcategory
Engineering
Hire Type
Employee
Remote Eligible
No

Descriptions & Requirements

Job Description and Requirements

We Are

Synopsys is the leader in engineering solutions from silicon to systems, enabling customers to rapidly innovate AI-powered products. We deliver industry-leading silicon design, IP, simulation and analysis solutions, and design services. We partner closely with our customers across a wide range of industries to maximize their R&D capability and productivity, powering innovation today that ignites the ingenuity of tomorrow.

You Are

You have spent years keeping production systems alive at scale, and you have learned that the best incident is the one that never happens. Reliability is not a checkbox for you, it is a design principle. You know the difference between a system that alerts when it breaks and one that fixes itself before anyone notices, and you are the kind of engineer who builds the latter.

AI is not a buzzword in your world. You have used LLMs and agents to automate runbooks, resolve alerts, and eliminate toil in ways that actually work in production. You think in telemetry and error budgets. When something goes wrong, you do not just fix it, you instrument it, post-mortem it, and make sure it never happens that way again.

You are comfortable writing Python or TypeScript to solve real operational problems, not just gluing scripts together. Kubernetes, Terraform, and OpenTelemetry are tools you use daily, not things you put on a resume. On-call does not scare you because you have built systems that rarely page, and when they do, you know exactly where to look.

At Synopsys, you will work on Cloud-native SaaS products that serve a global customer base. The stack is modern, the problems are hard, and the team expects you to lead.

What You'll Be Doing

  • Own availability, latency, performance, and capacity for Synopsys Cloud-native SaaS products running on Azure AKS
  • Design and deploy AI agents using LLMs, Azure OpenAI, or LangChain to automate complex operational workflows and reduce manual toil
  • Build self-healing internal services instrumented with OpenTelemetry that detect and resolve incidents before they impact customers
  • Define and enforce SLIs, SLOs, and error budgets across the platform using the ELK stack and Azure Monitor
  • Lead post-incident reviews and drive continuous improvement initiatives that turn every outage into an automation opportunity
  • Evaluate and integrate new CNCF technologies and AI frameworks to keep the reliability stack current and competitive
  • Participate in a rotational on-call schedule, providing expert-level support to maintain high-availability commitments for global customers

The Impact You Will Have

  • Reduce mean time to detection and mean time to resolution across the SaaS platform through intelligent automation and observability
  • Enable the engineering organization to ship faster by building self-service tooling and reliable deployment pipelines
  • Decrease operational overhead by replacing manual runbooks with AI-driven resolution systems that scale across services
  • Improve customer trust and retention by maintaining uptime and performance SLAs in a high-traffic production environment
  • Set the technical standard for reliability engineering across Synopsys Cloud products, influencing architecture and design decisions
  • Mentor engineers across teams on observability, incident response, and cloud-native best practices
  • Drive cost efficiency by optimizing resource utilization and capacity planning across Azure infrastructure

What You'll Need

  • 7+ years in a dedicated SRE or DevOps role managing high-traffic SaaS environments in production
  • Hands-on experience building or implementing AI agents using LLMs, Azure OpenAI, or LangChain to solve production engineering problems
  • Deep architectural knowledge of Azure, specifically AKS, Blob Storage, Redis Cache, Azure AD, Azure Monitor, and Azure Automation
  • Proficiency in Python and TypeScript with a strong grasp of data structures, object-oriented programming, and writing clean, testable automation code
  • Expert-level command of Terraform, Kubernetes, Helm, Docker, and GitHub Actions in production Cloud-native environments
  • Deep experience with OpenTelemetry and the ELK stack, including instrumentation, distributed tracing, and building meaningful dashboards
  • Strong understanding of Linux internals, networking protocols, and both SQL and NoSQL database administration. Experience with AWS or GCP is a plus. PowerShell knowledge is a plus.

Who You Are

  • You can walk into a war room during an outage, diagnose the root cause in minutes, and coordinate a fix without creating more chaos
  • You write automation that other engineers actually use because it solves a real problem and does not require a PhD to operate
  • You push back when a proposed architecture will not scale or when observability is an afterthought, and you do it in a way that moves the conversation forward
  • You treat on-call as a design feedback loop, not a burden, and you use every page as a signal to improve the system
  • You can explain a complex tradeoff between availability and cost to a product manager in two sentences without losing the technical nuance
  • You are comfortable working across time zones with distributed teams and can communicate technical strategy clearly in written and spoken English

The Team You'll Be Part Of

You will join a rapidly growing Cloud development and SRE team focused on delivering state-of-the-art cloud solutions. The team is scaling to meet increasing demand and building the next generation of reliability and automation capabilities across Synopsys SaaS products.

Rewards and Benefits

We offer a comprehensive range of health, wellness, and financial benefits to cater to your needs. Our total rewards include both monetary and non-monetary offerings. Your recruiter will provide more details about the salary range and benefits during the hiring process

At Synopsys, we want talented people of every background to feel valued and supported to do their best work. Synopsys considers all applicants for employment without regard to race, color, religion, national origin, gender, sexual orientation, age, military veteran status, or disability.