Current jobs related to Engineer, IT Cloud Site Reliability - Brentwood - Tractor Supply Company


  • Brentwood, California, United States Tractor Supply Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Tractor Supply Company. As a Site Reliability Engineering Manager, you will be responsible for managing and overseeing the engineering teams supporting a large-scale distributed application portfolio across on-prem and Cloud environments.The ideal candidate...


  • Brentwood, Tennessee, United States Jobot Full time

    Lead Site Reliability EngineerHiring a Lead Site Reliability EngineerAbout the RoleWe are seeking a seasoned Permanent Lead Site Reliability Engineer to join our dynamic team in the finance industry. The ideal candidate will be responsible for ensuring the reliability and robustness of our financial systems and services.Key ResponsibilitiesDesign, build, and...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Tractor Supply Company. As a key member of our engineering team, you will be responsible for managing and overseeing the engineering teams supporting our large-scale distributed application portfolio across on-prem and Cloud environments.Key...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Tractor Supply Company. As a key member of our IT organization, you will be responsible for managing and overseeing the engineering teams supporting our large-scale distributed application portfolio across on-prem and Cloud environments.Key...


  • Brentwood, Tennessee, United States Jobot Full time

    Job Title: Lead Site Reliability EngineerHiring a Lead Site Reliability Engineer is a key priority for Jobot, a dynamic and innovative company in the finance industry. We are seeking a seasoned professional to join our team and ensure the reliability and robustness of our financial systems and services.About the Role:The ideal candidate will work closely...


  • Brentwood, United States Jobot Full time

    Dice is the leading career destination​ for tech experts at every stage of their careers.  Our client, Jobot, is seeking the following.  Apply via Dice today!Hiring a Lead Site Reliability EngineerThis Jobot Job is hosted by: Zach MaxeyAre you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary: $130,000 - $180,000...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Lead Site Reliability Engineer at Tractor Supply Company, you will play a vital role in implementing modern Engineering and DevOps techniques to increase efficiency, eliminate downtime, optimize cost, and maintain performance at scale.Key ResponsibilitiesLead end-to-end availability, security, and performance of mission-critical applications...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Senior Manager of Site Reliability and DevOps Engineering at Tractor Supply Company, you will oversee the engineering teams supporting a large-scale distributed application portfolio across on-prem and Cloud environments.Key ResponsibilitiesManage end-to-end availability and reliability of API and data integration services, systems,...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Senior Manager of Site Reliability and DevOps Engineering at Tractor Supply Company, you will oversee the engineering teams supporting a large-scale distributed application portfolio across on-prem and Cloud environments.Key ResponsibilitiesManage end-to-end availability and reliability of API and data integration services, systems,...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Lead Site Reliability Engineer at Tractor Supply Company, you will play a vital role in implementing modern engineering and DevOps techniques to ensure the reliability and performance of our large-scale distributed application portfolio. You will provide hands-on technical expertise to design, deploy, secure, and optimize cloud services,...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Site Reliability Manager at Tractor Supply Company, you will oversee the engineering teams supporting a large-scale distributed application portfolio across on-prem and Cloud environments. Your focus will be on increasing efficiency, eliminating downtime, optimizing cost, and managing performance at scale while providing leadership in cloud...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Lead Site Reliability Engineer at Tractor Supply Company, you will play a vital role in implementing modern Engineering and DevOps techniques to ensure the reliability and performance of our large-scale distributed application portfolio. You will provide hands-on technical expertise to design, deploy, secure, and optimize cloud services,...


  • Brentwood, United States Tractor Supply Company Full time

    Overall Job Summary As a Lead Site Reliability Engineer you will play a vital role in implementing modern Engineering and DevOps techniques operating a large-scale distributed application portfolio across on-premises and cloud to increase efficiency, eliminate downtime, optimize cost, and maintain performance at scale. You will provide hands on technical...


  • Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryAs a Lead Site Reliability Engineer at Tractor Supply Company, you will play a vital role in implementing modern Engineering and DevOps techniques to increase efficiency, eliminate downtime, optimize cost, and maintain performance at scale.Key ResponsibilitiesLead end-to-end availability, security, and performance of mission-critical applications...


  • Brentwood, Tennessee, United States Broadridge Financial Solutions , Inc. Full time

    Job Title: Database Site Reliability EngineerBroadridge Financial Solutions, Inc. is seeking a highly skilled Database Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, building, and maintaining our database infrastructure to support our organization's data-driven...

  • Cloud Architect

    3 days ago


    Brentwood, California, United States Tractor Supply Full time

    Job SummaryThe Cloud Architect will be responsible for designing and implementing cloud-based infrastructure solutions for Tractor Supply Company. This includes leading the development of cloud architecture, ensuring scalability and reliability, and collaborating with cross-functional teams to deliver high-quality solutions.Key ResponsibilitiesDesign and...

  • Cloud Architect

    1 week ago


    Brentwood, Tennessee, United States Tractor Supply Full time

    Job SummaryThe Cloud Architect will be responsible for designing and implementing cloud-based solutions to support the growth and success of Tractor Supply Company. This role will involve leading the development of cloud infrastructure, ensuring scalability, security, and reliability.Key ResponsibilitiesDesign and implement cloud-based solutions to support...

  • Site Civil Engineer

    2 weeks ago


    Brentwood, Tennessee, United States Fives Full time

    About Fives GroupFives Group is a leading industrial engineering company that designs and supplies machines, process equipment, and production lines for various industries. With a rich history dating back to 1812, Fives has established itself as a trusted partner for global industrial groups.Job SummaryWe are seeking a highly skilled Site Civil Engineer to...


  • Brentwood, Tennessee, United States Cognizant North America Full time

    About Cognizant's Digital Engineering Practice:Cognizant Digital Engineering is a team of experts who build higher quality software faster by working together in a collaborative environment. Our teams are comprised of Product Managers, Architects, Full-Stack Developers, UI/UX designers, and Big Data analysts who share a common goal of delivering innovative...


  • Brentwood, Tennessee, United States Jobot Full time

    About the JobWe are seeking a highly skilled Civil Project Engineer to join our team at Jobot. As a key member of our engineering team, you will be responsible for designing and developing civil site plans and construction documents for land development projects.Key ResponsibilitiesPrepare civil site designs and construction documents using AutoCAD Civil...

Engineer, IT Cloud Site Reliability

2 months ago


Brentwood, United States Tractor Supply Company Full time
Overall Job Summary

A Cloud Site Reliability Engineer is a multifaceted role that combines elements of software engineering, system administration, and IT operations. Cloud SREs are responsible for ensuring the reliability, performance, and scalability of systems by focusing on system design, automation, monitoring, incident management, performance tuning, collaboration, and security. Their efforts directly impact the stability and efficiency of critical systems, enabling organizations to deliver reliable and efficient services at scale. This role requires a blend of technical expertise, problem-solving skills, and effective communication, making it essential for the success of modern, complex infrastructures. Essential Duties and Responsibilities

Vendor Management Strong negotiation skills, the ability to build better vendor relationships, network effectively, manage multiple vendors, identify financial risks, and evaluate new vendors. Industry awareness, strong people skills, and the ability to make effective decisions. Effective management by monitoring performance, managing risks, tracking key performance indicators, and ensuring compliance with regulations. Coordinating Teams Efforts Coordinate efforts with teams located onsite, offshore, nearshore, and across multiple vendors, providing clear direction, setting expectations, and motivating team members to achieve common goals. Ensure that tasks are assigned, schedules are aligned, and resources are allocated effectively across teams and vendors. Establish regular communication channels and protocols to ensure that information is shared, feedback is provided, and issues are addressed in a timely manner. Understand and respect cultural differences and work styles of team members and vendors from different regions. System Design and Architecture: Collaborate with software engineers to identify and mitigate risks to system availability and reliability. Automation and Tooling: Develop and maintain automation tools to streamline operations and reduce manual interventions. Monitoring and Incident Management: Help improve monitoring and alerting systems. Respond to incidents, perform root cause analysis, and implement permanent fixes to prevent recurrence, maintaining detailed documentation. Performance and Scalability: Conduct performance tuning, optimization and capacity management of systems to handle increasing loads and demand. Collaboration and Communication: Communicate effectively with stakeholders about system performance, incidents, and improvements. Foster a culture of reliability and continuous improvement across the organization. Security and Compliance: Ensure that systems and infrastructure comply with security best practices and regulatory requirements.
Required Qualifications

Experience: 4+ years related work experience. Experience in the retail industry preferred

Education: Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience. Any combination of education and experience will be considered.

Professional Certifications: None

High Demand IT Specialized skills:

Platform knowledge (UNIX, Linux, Windows): Public and Private Cloud Technologies (AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes). Hyper-converged Platforms (Nutanix, Simplivity), VMware vSphere 6, Microsoft Applications (Active Directory, Exchange, O365 and server OS), AHV, Kubernetes, Docker, Saltstack
Preferred knowledge, skills or abilities

Knowledge of ITIL Foundation concepts, practices, and procedures preferred. Knowledge of continuous improvement concepts preferred. Experience with programming and scripting languages (Python, Go, Java, Bash). Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack). Excellent problem-solving skills and the ability to work under pressure. Strong communication and collaboration skills, with a focus on teamwork and knowledge sharing. Strong Enterprise Application Support experience Strong Process Management skills Ability to manage ITSM Tools and Enterprise Support tools Understand data integration concepts. SDLC Waterfall and Agile knowledge preferred
Working Conditions
Normal office working conditions Physical Requirements Sitting Standing (not walking) Walking Lifting up to 20 pounds Disclaimer

This job description represents an overview of the responsibilities for the above referenced position. It is not intended to represent a comprehensive list of responsibilities. A team member should perform all duties as assigned by his/ her supervisor.