Site Reliability Engineer

4 days ago


Irving, Texas, United States Creospan Inc. Full time
Job Title: Site Reliability Engineer

At Creospan Inc., we are seeking a highly experienced Site Reliability Engineer to join our Application Production Support team. The ideal candidate will have a strong background in ensuring the reliability, performance, and scalability of complex systems.

Key Responsibilities:
  • Automation and Scripting:
    • Develop and maintain scripts to automate tasks and processes related to performance, scalability, and resilience.
    • Implement automation solutions to streamline operational workflows and reduce manual intervention.
  • Issue Triage and Resolution:
    • Triage and resolve issues affecting the platform's performance and stability.
    • Take ownership and accountability for the overall performance and reliability of the platform.
  • Tracking and Management:
    • Create and manage Jira tickets to track and resolve issues efficiently.
    • Ensure timely updates and closure of tickets to maintain workflow transparency.
  • System Monitoring and Health:
    • Monitor system health using SRE tools and proactively identify potential problems.
    • Utilize tools like Grafana, New Relic, and Kibana to monitor and analyze system performance metrics.
  • Collaboration and Data Analysis:
    • Collaborate with various cross-functional teams to gather necessary data and insights for troubleshooting and optimization.
    • Build data reports using Python to provide actionable insights to stakeholders.
  • Trend Monitoring and Operations:
    • Monitor trends in order processing and submission to ensure smooth operations.
    • Proactively address anomalies and issues to maintain high availability and reliability.
  • End-to-End Support:
    • Provide end-to-end support to the business, ensuring high availability and reliability of the platform.
    • Communicate effectively with cross-functional teams to ensure seamless support and operations.
Requirements:
  • Proficiency in using SRE tools such as Grafana, New Relic, and Kibana.
  • Strong scripting skills with experience in automation (e.g., Python, Shell scripting).
  • Experience in triaging and resolving performance and scalability issues for J2EE applications.
  • Ability to build and interpret data reports using Python.
  • Excellent problem-solving skills with a proactive approach to monitoring and maintenance.
  • Strong communication and collaboration skills to work effectively with cross-functional teams.
  • Preference for candidates with a background in DevOps practices and methodologies.
Qualifications:
  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 10+ years of overall IT experience, with at least 5 years specifically in a Site Reliability Engineer or similar role.
  • Proven experience in maintaining and supporting high-availability production environments.
  • Familiarity with J2EE application architecture and performance tuning.
  • Strong analytical skills and attention to detail.
Work Environment:
  • This position requires working from the office Hybrid.
  • May be required to support critical production incidents or project milestones.

Participation in an on-call rotation schedule to address critical production issues



  • Irving, Texas, United States Resource Informatics Group Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Resource Informatics Group. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services and applications.Key Responsibilities:Develop and maintain comprehensive...


  • Irving, Texas, United States Tata Consultancy Services Full time

    Job DescriptionAs a Site Reliability Engineer at Tata Consultancy Services, you will play a critical role in ensuring the reliability, scalability, and performance of our e-commerce platform.Key Responsibilities:Design and implement scalable and efficient e-commerce solutions using Java, Node.js, React.js, and Spring Boot.Collaborate with cross-functional...


  • Irving, Texas, United States Citi Full time

    About the RoleCiti, a leading global bank, is seeking a highly skilled Cloud Security Site Reliability Engineer to join its team. As a member of the Cloud Security team, you will be responsible for ensuring the security and reliability of Citi's cloud-based systems and applications.Key ResponsibilitiesDesign and implement secure cloud-based systems and...


  • Irving, Texas, United States Tata Consultancy Services Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our ecommerce platform.Key ResponsibilitiesPlatform Management: Run the production environment by monitoring...


  • Irving, Texas, United States Citigroup Inc Full time

    Job SummaryCitigroup Inc. is seeking a highly skilled Cloud Security Site Reliability Engineer to join our team. As a key member of our Cloud Security team, you will be responsible for ensuring the security and reliability of our cloud-based systems and applications.Key ResponsibilitiesDesign and implement secure cloud-based systems and...


  • Irving, Texas, United States Citigroup Inc Full time

    Overview of CitiCiti is a global bank with a presence in over 160 countries and a customer base of approximately 200 million. We provide a wide range of financial products and services to consumers, corporations, governments, and institutions.Chief Information Security Office (CISO)The CISO is responsible for ensuring the security of Citi's technical assets....


  • Irving, Texas, United States Citigroup Inc Full time

    About the RoleCitigroup Inc. is seeking a highly skilled Cloud Security Site Reliability Engineer to join our team. As a key member of our Cloud Security team, you will be responsible for ensuring the security and reliability of our cloud-based systems and applications.Key ResponsibilitiesDesign and implement secure cloud-based systems and...


  • Irving, Texas, United States Citigroup Inc Full time

    Overview of CitiCiti is a global bank with a presence in over 160 countries, serving millions of customers with a wide range of financial products and services. Our mission is to provide cutting-edge solutions that meet the evolving needs of our clients.Chief Information Security Office (CISO)The CISO is responsible for ensuring the security and integrity of...


  • Irving, Texas, United States Citigroup Inc Full time

    About the RoleCitigroup Inc. is seeking a highly skilled Cloud Security Site Reliability Engineer to join our team. As a key member of our Cloud Security team, you will be responsible for ensuring the security and reliability of our cloud-based systems and applications.Key ResponsibilitiesDesign and implement secure cloud-based systems and...


  • Irving, Texas, United States Resource Informatics Group Full time

    Position Title:Site Reliability EngineerLocation: Irving, TXThis role requires an individual who can operate on a W2 basis. The position necessitates in-office attendance for three weeks each day.This professional will be tasked with enhancing the risk and compliance framework of the Enterprise Observability solutions. Consequently, the individual will...


  • Irving, Texas, United States Citigroup Inc Full time

    About Citigroup Inc.Citigroup Inc. is a multinational investment bank and financial services corporation headquartered in New York City. The company was formed by the merger of banking giant Citibank and investment firm Travelers Group in 1998. Citigroup is one of the largest financial institutions in the world, with operations in over 160 countries and a...


  • Irving, Texas, United States Citigroup Inc Full time

    About Citigroup Inc.Citigroup Inc. is a multinational investment bank and financial services corporation headquartered in New York City. The company was formed by the merger of banking giant Citibank and investment firm Travelers Group in 1998. Citigroup is one of the largest financial institutions in the world, with operations in over 160 countries and a...


  • Irving, Texas, United States Hispanic Technology Executive Council Full time

    About CitiCiti is a leading global bank with a presence in over 160 countries and jurisdictions. We provide a wide range of financial products and services to consumers, corporations, governments, and institutions. Our Operations & Technology teams are responsible for designing and implementing technology solutions that support our business operations and...


  • Irving, Texas, United States Hispanic Technology Executive Council Full time

    About Hispanic Technology Executive Council: Hispanic Technology Executive Council, a leading organization in the technology industry, has a strong presence in over 160 countries and jurisdictions. We provide a wide range of financial products and services to consumers, corporations, governments, and institutions, including consumer banking and credit,...


  • Irving, Texas, United States Diverse Lynx Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our technical team, you will be responsible for ensuring the stability, scalability, and performance of our cloud-based microservices environment.Key ResponsibilitiesAutomation and Optimization: Design, implement, and maintain...


  • Irving, Texas, United States Citigroup Inc Full time

    Overview of CitiCiti is a global bank with a presence in over 160 countries and jurisdictions, serving approximately 200 million customer accounts. We provide a wide range of financial products and services to consumers, corporations, governments, and institutions, including consumer banking and credit, corporate and investment banking, securities brokerage,...


  • Irving, Texas, United States Diverse Lynx Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our technical team, you will be responsible for ensuring the reliability and scalability of our cloud-based e-commerce platform.Key ResponsibilitiesAutomation and Optimization: Deliver automation, monitoring/alerting optimization,...


  • Irving, Texas, United States The Intersect Group Full time

    A prominent client of The Intersect Group is in search of a Senior Reliability Engineering Manager. This pivotal role involves ensuring the stability, efficiency, and accessibility of the client's digital services. The selected candidate will oversee a team of remote Reliability Engineers, work collaboratively with various engineering and operational teams,...

  • Reliability Engineer

    2 weeks ago


    Irving, Texas, United States Matheson Tri-Gas Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineer to join our team at Matheson Tri-Gas. As a key member of our operations team, you will be responsible for ensuring the safety, reliability, and efficiency of our industrial air separation plants.Key ResponsibilitiesPerform engineering activities to support plant operations, including reliability...

  • Reliability Engineer

    2 weeks ago


    Irving, Texas, United States Matheson Tri-Gas Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineer to join our team at Matheson Tri-Gas. As a key member of our operations team, you will be responsible for ensuring the safety, reliability, and efficiency of our industrial air separation plants.Key ResponsibilitiesPerform engineering activities to support plant operations, including reliability...