Descriptions & Requirements
We Are:
At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous technological innovation. This team handles design, vision, and driving major initiatives for Data Center and cloud. We support a global network consisting of 100+ office locations, 12 data centers, 30MW of Colocation capacity, and a growing cloud presence.
You Are:
You are a motivated and passionate Site Reliability Engineer with a deep technical affinity for Data Centers. You thrive in environments where innovation and reliability intersect, bringing your proven track record of strategic decision-making to positively impact both team performance and organizational success. Your excellent communication skills enable you to seamlessly collaborate with cross-functional teams and effectively represent your work to senior leadership. You are comfortable navigating matrixed, international, and team-oriented settings, adept at managing priorities and aligning stakeholders toward common goals. As an experienced and process-oriented individual contributor, you are excited to join Synopsys’s Dallas Data Center and play a pivotal role in its ongoing transformation. Your commitment to operational excellence, proactive problem-solving, and continuous improvement is matched by your enthusiasm for new technologies and automation. You are driven by the challenge of optimizing complex systems, ensuring reliability, and mentoring others, while maintaining a forward-thinking approach to anticipate future challenges and opportunities. You embrace the responsibility of overseeing critical infrastructure, mentoring junior engineers, and driving adoption of innovative solutions. Your passion for hardware reliability, automation, and high-performance computing is evident in your ability to translate complex technical concepts into actionable insights for both technical and non-technical audiences. If you are self-motivated, independent, and committed to making a lasting impact, Synopsys welcomes you to help shape the future of our data center operations.
What You’ll Be Doing:
- Oversee all aspects of Data Center critical infrastructure, ensuring high-quality execution of all work performed within the DC space.
- Mentor junior engineers, and contribute to the development of technical standards.
- Monitor support queues, track escalations, address tickets promptly, and communicate updates clearly.
- Lead the design and adoption of AI-enabled capabilities such as predictive analytics, anomaly detection, and automated incident response to proactively resolve complex issues.
- Develop, maintain, and govern technical documentation including network architectures, deployment playbooks, policies, standards, and guidelines.
- Coordinate with colocation and external vendors to complete engineering, maintenance, and outsourcing tasks in compliance with Data Center operation requirements.
- Ensure accurate asset tracking and lifecycle management using DCIM platforms, integrating with enterprise systems like ITSM, CMDB, and monitoring platforms.
- Leverage DCIM data models, APIs, and automation tools for predictive analytics and cost optimization.
- Drive adoption of advanced HPC technologies and collaborate with application teams to optimize workloads for efficiency.
- Oversee regular maintenance tasks to prevent problems and ensure equipment longevity.
- Assist with diagnosing, fixing, and installing server components, providing expertise in hardware reliability engineering.
- Implement automation strategies to streamline operations and reduce manual intervention.
- Track system performance and availability, implementing proactive measures to prevent downtime.
- Diagnose and resolve issues related to Linux OS, services, and system components.
- Participate in Data Center capacity management forums and on-call schedules.
The Impact You Will Have:
- Enhance the reliability and performance of Synopsys’s Data Center infrastructure, supporting critical engineering workloads.
- Reduce operational risks and downtime through proactive maintenance and advanced automation.
- Enable faster incident resolution and improved service quality by leading AI-driven monitoring and response initiatives.
- Drive efficiency and scalability for high-performance computing resources, empowering engineering teams to innovate.
- Facilitate seamless integration of Data Center systems with enterprise platforms, improving asset management and visibility.
- Mentor and develop junior engineers, fostering a culture of technical excellence and continuous improvement.
- Contribute to the strategic evolution of Synopsys’s Data Center operations, positioning the company at the forefront of technology.
- Support global expansion and cloud initiatives, ensuring robust infrastructure for future growth.
What You’ll Need:
- Bachelor’s degree (or equivalent experience) in Engineering, Material Science, Physics, or a related field; Master’s preferred.
- 8+ years’ experience in hardware validation/reliability environments related to servers, storage, PCIe, and GPUs.
- Strong scripting and automation skills, particularly in Python and Ansible.
- Comprehensive understanding of power supply, memory, PCI, Ethernet, Linux operating systems, and enterprise ticketing systems like ServiceNow.
- Analytical and problem-solving expertise with customer issues, engineering challenges, and large-scale systems.
- Hands-on experience with theoretical and practical reliability concepts in high-tech electronic enterprise and consumer products.
- Command of statistical concepts/models/analysis as they relate to product reliability and life cycle analysis.
- Excellent verbal and written communication skills, able to translate complex technical concepts for diverse audiences.
- Self-motivated, independent, and committed to delivering results.
- Strong project management skills, able to balance multiple projects during development and production stages.
- Forward-thinking mindset to anticipate future challenges and proactively resolve issues.
Who You Are:
- Collaborative team player, comfortable in a matrixed, international environment.
- Process-oriented individual with a passion for continuous improvement.
- Effective communicator, able to engage with both technical and non-technical stakeholders.
- Mentor and motivator, eager to develop junior talent.
- Adaptable and resilient, thriving in fast-paced and evolving environments.
- Resourceful problem-solver with strategic vision.
- Proactive, detail-oriented, and committed to operational excellence.
The Team You’ll Be A Part Of:
The Data Center team leads initiatives that enable Synopsys to meet and exceed our objectives by focusing on providing a reliable compute and storage infrastructure to our engineering community. Our team collaborates with all business units to enable capacity and engage in opportunities that change our systems, methods, and facilitate greater productivity, efficiency, and scale.
Rewards and Benefits:
We offer a comprehensive range of health, wellness, and financial benefits to cater to your needs. Our total rewards include both monetary and non-monetary offerings. Your recruiter will provide more details about the salary range and benefits during the hiring process.
At Synopsys, we want talented people of every background to feel valued and supported to do their best work. Synopsys considers all applicants for employment without regard to race, color, religion, national origin, gender, sexual orientation, age, military veteran status, or disability.
In addition to the base salary, this role may be eligible for an annual bonus, equity, and other discretionary bonuses. Synopsys offers comprehensive health, wellness, and financial benefits as part of a competitive total rewards package. The actual compensation offered will be based on a number of job-related factors, including location, skills, experience, and education. Your recruiter can share more specific details on the total rewards package upon request. The base salary range for this role is across the U.S.