Site Reliability Engineer

1 day ago


San Jose, CA, United States McAfee Full time

Job Title:

Site Reliability Engineer

Role Overview:

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team, with a specialized focus on supporting and administering the software development tools that power McAfee's CI/CD pipeline. This role is ideal for someone passionate about toolchain reliability, automation, and the integration of AI-driven solutions into modern DevOps workflows. You will play a key role in ensuring the performance, scalability, and resilience of our development lifecycle tools-while also helping us explore and implement AI-enhanced capabilities that streamline operations and accelerate innovation.

This is a Hybrid Position located in Texas. We are only considering candidates within a commutable distance to the Frisco office. You will be required to be onsite on an as-needed basis; when not working onsite, you will work from your home office.

About the Role:

  • Manage and administer tools critical to the software development lifecycle, including Jira, GitHub, Confluence, ClickUp, Artifactory, Figma, FullStory and others.

  • Optimize tool configurations, integrations, and workflows to enhance productivity and collaboration across teams.

  • Strong willingness to provide technical support and troubleshoot issues related to software development tools.

  • Work directly with software vendors to support applications, if necessary.

  • Be available to respond to Major Incidents; be comfortable working and collaborating in critical situations.

  • Collaborate with cross-functional teams to identify areas for improvement and implement innovative solutions.

  • Drive automation efforts to reduce manual interventions and enhance system resilience.

  • Stay on top of trends and emerging technologies within the Software Development Tool space.

  • Experience with AI integrations, strong familiarity with latest developments and tools in AI space.

  • Lead or participate in planning, testing and execution of large-scale data migration processes required due to changes or upgrades in toolsets.

  • Effectively collaborate with consultants to streamline migration processes.

  • Construct communications to effectively communicate with end users.

  • Ensure the highest levels of system and infrastructure availability by proactively identifying and mitigating potential risks.

  • Support system scaling efforts to accommodate increasing demands while maintaining optimal performance.

  • Ensure applications are compliant with security best practices.

  • Provide comprehensive production monitoring to identify, troubleshoot, and resolve issues and incidents.

About You:

  • 3+ years' experience in site reliability engineering, DevOps, or a related role.

  • High-level competency in Git and experience administering GitHub and Jira/Confluence (preference for expert-level Jira skills).

  • Strong preference for experience with GitHub Actions, but experience with other CI/CD tools like Jenkins, TeamCity, etc., is acceptable.

  • Desired experience administering tools such as Artifactory, ClickUp, Figma, FullStory, Anaconda, BlackDuck, and Figma.

  • Strong knowledge of system administration, performance monitoring, and troubleshooting.

  • Experience working in cybersecurity environments, ensuring secure system design, monitoring, and compliance with security best practices.

  • Proficiency in scripting and automation tools (e.g., Python, Bash, Terraform.)

  • Hands-on experience with cloud platforms (e.g., AWS, Azure, GCP).

  • Familiarity with CI/CD pipelines and version control systems.

  • Willingness to complete support tickets and provide excellent technical support to ensure smooth operations.

  • Strong communication skills, both written and verbal.

  • Desired skills/experiences include GitHub Copilot, Claude Code or other comparable GenAI tools, GitHub Advanced Security and JFrog Curation, familiarity with Linux/Unix systems, including command-line expertise and system troubleshooting, SSH, remote system management, Prometheus, Grafana, Splunk, PowerBI or comparable Business Intelligence/Analytics tools, Okta, IAM, Docker, Kubernetes.

  • Bachelor's degree in computer science, Engineering, or a related field is a plus.

#LI-Hybrid

Company Overview

McAfee is a leader in personal security for consumers. Focused on protecting people, not just devices, McAfee consumer solutions adapt to users' needs in an always online world, empowering them to live securely through integrated, intuitive solutions that protects their families and communities with the right security at the right moment.

Company Benefits and Perks:

We work hard to embrace diversity and inclusion and encourage everyone at McAfee to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.

  • Bonus Program

  • 401k Retirement Plan

  • Medical, Dental, Vision, Basic Life, Short Term Disability and Long-Term Disability Coverage

  • Paid Parental Leave

  • Support for Community Involvement

  • 14 Paid Company Holidays

  • Unlimited Paid Time Off for Exempt Employees

  • 96 Hours of Sick Time and 120 Hours of Vacation for Non-Exempt Employees Accrued Each Year

We're serious about our commitment to diversity which is why McAfee prohibits discrimination based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status.

The starting pay range for this position is $81,120.00-$133,260.00. McAfee takes into consideration an individual's skillset, experience and location in making final salary determinations. For further details, please discuss with the Talent Acquisition Partner.

Please click here (https://www.mcafee.com/content/dam/consumer/en-us/docs/legal/mcafe-job-applicant-ccpa-notice.pdf) to view and download the Job Applicant Privacy Notice, which applies to all McAfee job applicants who are residents of the state of California.



  • San Jose, CA, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: San Jose, CA (Remote) Type: Contract Job Description: 1) NVIDIA (DGX) - A100/ H100/ H200 2) Cisco UCS-C885A 3) Docker 4) NVIDIA certificated professionals preferred 5) Infrastructure knowledge on above skills 6) DevOps Automation: CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins) Terraform, Ansible, Jenkins...


  • San Jose, CA, United States PayPal Full time

    The Company PayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy. We operate a global, two-sided network at scale...


  • San Jose, CA, United States EPAM Systems Inc Full time

    At EPAM, we're not just building software - we're engineering excellence. We're looking for a Lead Site Reliability Engineer (SRE) with a passion for performance, precision, and proactive problem-solving to join a high-impact team supporting a leading sell-side trading environment. This role is ideal for someone who thrives in fast-paced financial systems,...


  • San Jose, CA, United States Hewlett Packard Enterprise Development LP Full time

    Principal Site Reliability Engineer This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications...


  • San Jose, CA, United States PayPal Full time

    The Company PayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy. We operate a global, two-sided network at scale...


  • San Jose, CA, United States PayPal Full time

    The Company PayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy. We operate a global, two-sided network at scale...


  • San Jose, CA, United States PayPal Full time

    The Company PayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy. We operate a global, two-sided network at scale...


  • San Jose, CA, United States PayPal Full time

    The Company PayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy. We operate a global, two-sided network at scale...


  • San Francisco, CA, United States ConductorOne Full time

    ConductorOne is the first AI-native identity security platform that protects every identity: human, non-human, and AI. With powerful automation, platform-level AI, and out-of-the-box connectors, it centralizes access visibility, enforces fine-grained controls, enables just-in-time access, and automates user access reviews across all apps. It's easy to use,...


  • San Francisco, CA, United States ConductorOne Full time

    ConductorOne is the first AI-native identity security platform that protects every identity: human, non-human, and AI. With powerful automation, platform-level AI, and out-of-the-box connectors, it centralizes access visibility, enforces fine-grained controls, enables just-in-time access, and automates user access reviews across all apps. It's easy to use,...