Sr. Site Reliability Engineer

3 weeks ago


Seattle, United States Comtech LLC Full time

Comtech is a woman-owned small business founded in 1998 and headquartered in Reston, VA. We offer IT solutions across the disciplines of program/project management, applications development, infrastructure, Cyber security, and enterprise content/data management services. We have developed our methodologies and processes based on the IT Infrastructure Library (ITIL) v.3 Framework across enterprise infrastructure operations. These methodologies and processes are reinforced through our organization’s externally accredited certifications, which include ISO 9001:2008 Quality Management System (QMS), ISO/IEC 20000-1:2011 IT Service Management Systems (SMS, corporate ITIL certification), ISO 27001:2005 Information Security Management System (ISMS), and CMMI-DEV Level 3"Job DescriptionSr. Site Reliability EngineerLocation – Seattle, WADuration – 12 monthsInterview – in-person if local or Phone + skypeMinimum Requirement:These are top 4 criteria important for SREs1. C# /Java/ coding project exp 30% of the work involves going into API and handling code /bugs /error etc.2. Platform experience = Chef, Puppet, Azure, Ansible. Good hands on in implementing and executing at least 2 of these. This requires language on Python, PowerShell, Ruby, Perl. At least experience in 2 of these languages is must – Requires 30% of work3. Infrastructure experience – Windows/Linux/Unix server management. Network and security protocols. Load balancing and system engineering support – 30% of work4. Agile and project methodologies knowledge – 10% workJob Description:As a Sr. Site Reliability Engineer – you will be responsible for the day-to-day maintenance and administration of Internet-based enterprise systems. On an on-going basis, this position will identify root causes of operational issues in order to resolve them. As required, this position will help develop tools and scripts to facilitate that maintenance and administration.This position will also work closely with other teams to document the enterprise infrastructure and monitoring systems. You will also be responsible for planning and execution of small to large-scale projects within the Technology teams under the direction of the manager.This role requires your A-Game: deep technical proficiency in both enterprise-scale systems as well as next gen cloud native applications required. So if you believe, like we do, that a cup of coffee can change a life and change our world, come check us out and help us deliver that same amazing experience to our customers around the globe.Must Haves/Nice to Haves:• Experience working in a high capacity, highly scalable mission-critical web serving environment• Proven ability to participate with other functional teams in systems integration and design including writing operational specifications, test plans and requirements management with attention to detail• UNIX/LINUX and Windows and server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures• Web (IIS, Apache), .Net & Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures• Experience in at least two relevant scripting or programming languages (Ruby, Perl, Python, Shell, PowerShell, etc.)• Experience with Configuration Management platforms (Chef, Ansible, CFEngine, Puppet, etc.)• Database Administration – setup, configuration and basic database troubleshooting skills• Understanding of internet standards such as HTTP, DNS, FTP, SSH, HTML, XML, JDBC, ODBC, SNMP and other protocols• Understanding of high availability hardware and database systems design and implementation including cluster management, redundancy and failover testing• Knowledge of storage systems (SAN, NAS, RAID Array, etc)• Experience hardening and maintaining secure systems (Safe Harbor or PCI experience a plus)• Network hardware architecting experience with load balancing equipment, switches, routers, and network troubleshooting• Ability to produce system documentation, including writing requirements, operational specifications, system architecture, test plans and as-built documentation, all with attention to detail• Experience working with ITIL and Service Management best practices is a plus.• Ability to build strong relationships and influence others across the organization• Demonstrated knowledge of agile project methodologies• 5+ years experience designing, supporting and deploying Internet-based products or services• 4+ years operating complex, large-scale Enterprise guest-facing Applications or web sitesBest Regards,Desk- 703 962 6656Additional Information**Please share me your updated word copy of Resume.*** I'll Appreciate, if you can refer someone who is looking for this position. #J-18808-Ljbffr



  • Seattle, United States F5 Full time

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.  Everything we do centers...


  • Seattle, United States Apple Full time

    Sr. Site Reliability Engineer (SRE) - iCloudSeattle, Washington, United StatesSoftware and ServicesPosted: Apr 30, 2025Weekly Hours: 40Role Number: 200593986The Apple Service Engineering - iCloud SRE team is looking for Site Reliability Engineers to build and run the services that hundreds of millions of customers use every day. This team provides systems...


  • Seattle, WA, United States Apple Full time

    Role Number: 200635067-3337 Summary The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production environments. Our SRE team combines software and systems engineering and system administration practices to build and run...


  • Seattle, Washington, United States Coupang Full time $176,000 - $221,000 per year

    Job Overview:Site Reliability Engineers (SREs) at Coupang is a mission-critical role which combines software and system engineering to build, run and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer facing services are healthy, monitored, automated, and...


  • Seattle, United States Redfin Full time

    Join to apply for the Site Reliability Engineer role at RedfinThis position is a hybrid role requiring employees to work from our headquarters location in Seattle, WA every Tuesday and Wednesday, and remote all other days.Job DescriptionRedfin is revolutionizing the $75 billion real‑estate industry. We use data, beautiful software, and innovative design...


  • Seattle, WA, United States Kaav Inc. Full time

    Who we are We are a yoga-inspired technical apparel company up to big things. The practice and philosophy of yoga informs our overall purpose to elevate the world through the power of practice. We are proud to be a growing global company with locations all around the world, from Vancouver to Shanghai, and places in between. We owe our success to our...


  • Seattle, United States ByteDance Full time

    Senior Site Reliability Engineer - Data Infrastructure Location: Seattle Team: Technology Employment Type: Regular Job Code: A32035 Responsibilities Team Introduction: Our Site Reliability Engineering (SRE) team blends software and systems engineering to build and operate large‑scale data infrastructure with high reliability and efficiency. We provide a...


  • Seattle, Washington, United States Apple Full time

    People at Apple don't just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you've used Apple products, you've likely interacted with us. Apple Services Site Reliability Engineering (SRE) teams are responsible for...


  • Seattle, United States Qumulo Full time

    Join to apply for the Site Reliability Engineer role at QumuloJoin to apply for the Site Reliability Engineer role at QumuloAbout The CompanyQumulo is the unstructured data platform to store and manage exabyte-scale data anywhere – at the edge, in the core data center and in the cloud. With unstructured data growing in more locations faster than ever...


  • Seattle, United States MCG Health Full time

    Join to apply for the Senior Site Reliability Engineer role at MCG HealthJoin to apply for the Senior Site Reliability Engineer role at MCG HealthAt MCG, we lead the healthcare community to deliver patient-focused care. We have a mission-driven team of talented physicians and technical experts developing our evidence-based content and innovating our products...