Reliability and Performance Engineer

6 days ago


Mountain View, California, United States BCForward Full time
Job Description

BCforward is currently seeking a highly motivated Site Reliability Engineer for an opportunity in a dynamic and innovative company.

Position Title: Site Reliability Engineer

Location: Remote (with occasional on-site visits)

Job Type: Contract (40 hours weekly), Hybrid

Pay Range: $95/hr - $97/hr

Please note that actual compensation may vary within this range due to factors such as location, experience, and job responsibilities.

Requirements:

  • Linux/Unix

Responsibilities:


Data Monitoring and Alerting: Design and implement data monitoring and alerting systems to ensure timely detection and response to issues.


Data Quality Assurance and Anomaly Detection: Develop and maintain data quality assurance processes to identify and address anomalies.


System Design and Implementation: Analyze and design solutions to remove bottlenecks and improve system performance.


Monitoring and Alerting: Implement monitoring and alerting systems to improve issue detection and response.


Technical Operations: Participate in technical operations and rotations in response to performance and reliability issues.


On-Call Rotations: Participate in on-call rotations, responsible for resolving or escalating incoming events.


Linux and Kubernetes Environment: Maintain and operate a Linux and Kubernetes environment.

Qualifications:


3+ years experience working with Unix Linux systems


Experience reading python scripts for platform operations


Experience in networking technologies such TCP/IP, BGP, DNS, etc. in a carrier-grade environment


Experience in developing and operating one or more of following systems: OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop, etc.


Bachelor's degree or above, majoring in Computer Science or related fields, with at least 2 years of related work experience

Benefits:

BCforward offers all eligible employees a comprehensive benefits package including, but not limited to major medical, HSA, dental, vision, employer-provided group life, voluntary life insurance, short-term disability, long-term disability, and 401k.

About BCforward:

BCforward is a Black-owned firm providing unique solutions supporting value capture and digital product delivery needs for organizations around the world. Headquartered in Indianapolis, IN with an Offshore Development Center in Hyderabad, India, BCforward's 6,000 consultants support more than 225 clients globally.

BCforward champions the power of human potential to help companies transform, accelerate, and scale. Guided by our core values of People-Centric, Optimism, Excellence, Diversity, and Accountability, our professionals have helped our clients achieve their strategic goals for more than 25 years. Our strong culture and clear values have enabled BCforward to become a market leader and best in class place to work.

BCforward is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against based on disability.

To learn more about how BCforward collects and uses personal information as part of the recruiting process, view our Privacy Notice and CCPA Addendum. As part of the recruitment process, we may ask for you to disclose and provide us with various categories of personal information, including identifiers, professional information, commercial information, education information, and other related information. BCforward will only use this information to complete the recruitment process.

This posting is not an offer of employment. All applicants applying for positions in the United States must be legally authorized to work in the United States. The submission of intentionally false or fraudulent information in response to this posting may render the applicant ineligible for the position. Any subsequent offer of employment will be considered employment at-will regardless of the anticipated assignment duration.



  • Mountain View, California, United States CUSHMAN Full time

    Job TitleLead Reliability EngineerJob Description SummaryThe Lead Facilities Reliability Engineer will develop, implement and track facilities reliability and maintenance engineering programs at client site with a focus on performing facilities condition assessments and maintaining the facilities condition assessment database. Utilizing plant...


  • Mountain View, California, United States CENTRL Full time

    CENTRL is looking for a highly skilled and innovative Senior Site Reliability Engineer to take charge of our cloud and infrastructure operations. In this pivotal role, you will be responsible for the strategic oversight, planning, and implementation of our IT systems to guarantee optimal performance, scalability, and availability.Key ResponsibilitiesAnalyze...


  • Mountain View, California, United States CENTRL Full time

    CENTRL is looking for a skilled and proactive Senior Site Reliability Engineer to enhance our cloud and infrastructure operations. In this pivotal role, you will be responsible for the strategic oversight, planning, and implementation of our IT systems to ensure optimal performance, scalability, and availability.Key ResponsibilitiesAnalyze and gather metrics...


  • Mountain View, California, United States CENTRL Full time

    CENTRL is looking for a highly skilled and innovative professional to take on the role of Senior Site Reliability Engineer. In this pivotal position, you will be responsible for the strategic oversight, planning, and implementation of our cloud and infrastructure operations, ensuring optimal availability, scalability, and performance of our IT systems.Key...


  • Mountain View, California, United States Samsung Full time

    Embedded Site Reliability Engineer (Samsung Ads)remote typeHybridlocations645 Clyde Avenue, Mountain View, CA, USAOne Pennsylvania Plaza, 26th Floor, New York, NY, USAtime typeFull timejob requisition idR84565Position SummaryIn recent years, Samsung has transformed its hardware dominance into a dynamic ecosystem of engaging services across devices. Enter...


  • Mountain View, California, United States Samsung Electronics Full time

    Position OverviewSamsung has evolved from a hardware leader into a vibrant ecosystem of innovative services across devices. At the forefront of this transformation is Samsung Ads, a flourishing division poised for significant growth.Our Global Ads Product & Engineering team, with a robust presence across multiple countries, is integral to this advancement....


  • Mountain View, California, United States Google Inc. Full time

    Location: Mountain View, CA, USALevel:MidAs a pivotal member of the Hardware Testing Engineering team, you will play a crucial role in ensuring the reliability of advanced computing systems. Your expertise will be essential in the R&D lab, where you will design and implement testing protocols for prototypes, collaborating closely with design engineers to...


  • Mountain View, California, United States VentureDive Full time

    Job Brief:As Data Platform Site Reliability Engineering you will manage infrastructure and applications on cloud computing platforms to deliver data processing, governance, and storage. Our platform teams work with exabytes of data, terabytes of memory, and hundreds of thousands of jobs to enable predictable and performant data analytics.As an SRE, you'll...


  • Mountain View, California, United States Atlassian Full time

    About the RoleWe're seeking a highly skilled Cloud Infrastructure Engineer to join our Site Reliability team at Atlassian. As a Site Reliability Engineer, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.Key ResponsibilitiesDesign and Implement Cloud Infrastructure: Collaborate with...


  • Mountain View, California, United States Insight Global Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team in the Bay Area. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud...


  • Mountain View, California, United States Groq Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and related support systems.Key...


  • Mountain View, California, United States Motion Recruitment Full time

    About the RoleMotion Recruitment is seeking a highly skilled Linux Systems Engineer to join our team. As a Site Reliability Engineer, you will be responsible for managing and maintaining large-scale Linux environments, implementing automation, and ensuring the reliability and scalability of our systems.Key ResponsibilitiesDesign, implement, and maintain...


  • Mountain View, California, United States Atlassian Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Atlassian. As a Site Reliability Engineer, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.Key ResponsibilitiesImprove Service Reliability: Actively work to improve the performance and reliability of...


  • Mountain View, California, United States Google Inc. Full time

    Location: Mountain View, CA, USAPosition Level: MidWe are seeking an experienced professional with a strong background in hardware testing, capable of driving advancements, resolving challenges, and guiding junior team members. The ideal candidate will possess extensive knowledge and practical experience in the relevant field.Minimum...

  • Reliability Engineer

    2 weeks ago


    Mountain View, California, United States TikTok Full time

    TikTok stands as a premier platform for short-form mobile video, dedicated to fostering creativity and spreading joy. Our global presence spans numerous cities, reflecting our commitment to innovation and community. The Trust and Safety Engineering Team is rapidly expanding, focusing on the development of advanced machine learning models and systems aimed at...

  • Reliability Engineer

    2 weeks ago


    Mountain View, California, United States TikTok Full time

    TikTok is a premier platform for short-form mobile video, dedicated to fostering creativity and delivering joy. Our Trust and Safety engineering division is rapidly expanding, focusing on developing machine learning models and systems aimed at identifying and mitigating internet abuse and fraud across our platform. Our objective is to safeguard billions of...

  • Reliability Engineer

    2 weeks ago


    Mountain View, California, United States TikTok Full time

    TikTok is the premier platform for short-form mobile video, dedicated to fostering creativity and spreading joy. Our Trust and Safety engineering division is rapidly expanding, focusing on the development of machine learning models and systems designed to combat internet abuse and fraud. Our objective is to safeguard billions of users and content creators...


  • Mountain View, California, United States eTek IT Services, Inc. Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer - Cloud Infrastructure to join our team at eTek IT Services, Inc.Role: As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure.Responsibilities:Data Monitoring and Alerting: Design and implement...


  • Mountain View, California, United States TikTok Full time

    TikTok stands as a premier platform for short-form mobile video, dedicated to fostering creativity and delivering joy. Our global presence spans numerous cities, enhancing our mission to protect users and content creators worldwide. The Trust and Safety Engineering Team is rapidly expanding, tasked with developing advanced machine learning models and systems...


  • Mountain View, California, United States TikTok Full time

    About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our Trust and Safety engineering team. As a Site Reliability Engineer, you will be responsible for managing the day-to-day operations of our data services, including SLA management, system deployment, performance tuning, and troubleshooting.Key ResponsibilitiesManage...