Infrastructure Reliability Specialist

2 weeks ago


Bonney Lake, United States CharacterStrong Full time

Position:


The Infrastructure Reliability Specialist at CharacterStrong is responsible for the design and upkeep of scalable, dependable, and efficient systems that ensure optimal performance and uptime.

This role emphasizes the automation of operational tasks, deployment processes, and scaling methodologies, while actively seeking out and addressing infrastructure weaknesses and deployment challenges.

Specialists in this role take charge of the Developer Experience (DevEx), working closely with engineers to enhance service reliability and performance.

They are integral in executing incident response simulations, overseeing system performance, and establishing monitoring frameworks for the team.

These specialists advocate for high availability, performance optimization, and disaster recovery strategies, while promoting system enhancements and best practices.

A Bachelor's degree in Computer Science or a related discipline, along with 3+ years of experience in a similar role, proficiency in AWS services, container orchestration tools, and strong analytical skills are essential.


CharacterStrong's Background & Mission:


CharacterStrong is a dynamic tech education organization that develops PreK-12 digital curricula focused on social-emotional learning and provides professional development resources to assist schools in their implementation efforts.

The team consists of approximately 60 full-time staff members, several part-time employees, and over 30 trainers collaborating to realize this mission.

Our goal is to foster a more compassionate world by equipping educators with the necessary tools to teach critical social, emotional, and character skills that nurture a more empathetic, connected, and generous society.

At CharacterStrong, you will have the chance to positively influence education both domestically and globally.

CharacterStrong team members contribute their creativity, commitment to excellence, and empathy to develop transformative curricula and training for educators.

CharacterStrong's Company Values:


We Practice Kindness - Embracing inclusion, care, and empathy in our interactions, balancing honesty with compassion during difficult conversations, and fostering kindness towards oneself.

We Produce Excellence - Delivering timely, high-quality results and consistently asking, "How can we enhance this by 1%?"

We Take Ownership - Proactively driving initiatives forward, demonstrating accountability when challenges arise, and actively closing identified gaps.

We Problem Solve - Recognizing issues, analyzing them for understanding, and taking action to implement optimal solutions.

Infrastructure Reliability Specialist Responsibilities:


Design and maintain scalable, dependable, and efficient systems to ensure optimal uptime and effective issue resolution.

Automate operational tasks, deployment processes, and scaling methodologies.
Identify, analyze, and rectify infrastructure vulnerabilities and application deployment challenges.
Take charge of the

Developer Experience:
Identify and implement strategies to enhance overall DevEx
Collaborate with engineers to improve service reliability and performance.
Conduct incident response simulations and post-incident analyses.
Monitor system performance and troubleshoot issues.

Create monitoring frameworks for the team to easily access potential issues across various areas of our systems and services.

Ensure high availability and acceptable performance levels of mission-critical resources.
Develop disaster recovery strategies and engage in service capacity planning.
Implement automation tools for system health and performance monitoring.
Update and maintain documentation for configuration and troubleshooting procedures.
Advocate for system enhancements and best practices.

Required Qualifications:
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
3+ years of experience in an Infrastructure Reliability Specialist or similar capacity.
Experience with AWS cloud services, DynamoDB, and Elasticsearch. Familiarity with Pulumi and Playwright is a plus.
Knowledge of container orchestration tools (Kubernetes, Docker with AWS Lambda and Serverless functions).
Understanding of network protocols and theory.
Experience with monitoring and alerting systems.
Substantial knowledge and experience with common web development skills, including Rest APIs, React, HTML, CSS, Typescript or JavaScript (nodeJS) experience, Serverless Technology, Tools, Debugging, Basic Graphic Design, Back end and Databases, Hosting Services, Libraries and frameworks, SEO, CI Pipelines such as Github Actions.
Proven ability to implement innovative design theories and coding best practices.
Strong analytical and communication skills.
Experience with configuration management tools.
Ability to work collaboratively within a team environment.
Knowledge of database management and NoSQL.
Commitment to ongoing learning and improvement.

Other Duties:
Performs additional responsibilities as assigned.
Salary & Benefits

Starting Salary $100,000 - $125,000, based on experience
Remote work flexibility
Annual bonus contingent on company performance
Provision of a new laptop, AirPods, and other necessary equipment
Annual Individual Budget for Professional Development of $1,000
401k Plan after 12 months of Employment
Medical, Dental, and Vision Insurance
Paid Parental Leave after 12 months of employment for eligible employees
10 Paid Vacation Days, 8 Paid Sick Days, 14 Paid Company Holidays


CharacterStrong values diversity and the unique ways team members connect with our student and educator populations as an asset.

Our aim is to ensure our team reflects the diverse student population we serve.
CharacterStrong is an equal-opportunity employer, committed to fair treatment of all employees based on merit.

In accordance with applicable law, race, color, creed, ancestry, national origin, citizenship, sex or gender (including pregnancy, childbirth, and pregnancy-related conditions, sexual orientation, gender identity or expression, and transgender status), marital status, religion, age, disability, genetic information (including testing and characteristics), service in the military, or any other characteristic protected by applicable federal, state, or local law does not affect employment opportunities or practices such as hiring, promotion, development opportunities, pay, or benefits.

CharacterStrong adheres to all applicable federal, state, and local labor laws.

  • Lake Zurich, Illinois, United States Dovenmuehle Full time

    Position Type: Full-time; ExemptDepartment: PC/LANDovenmuehle Mortgage, Inc. is a premier mortgage subservicing organization in the United States, serving numerous financial institution clients across the nation.OverviewWe are in search of a proficient Site Reliability Engineer (SRE) with a focus on DevOps to bolster the dependability and efficiency of our...


  • Lake Mary, United States CentralSquare Technologies Full time

    Job DescriptionJob DescriptionWhat We’re About At CentralSquare, you’ll get the opportunity to work in a collaborative environment within a company that builds complex web-based enterprise applications for our Public Servants across North America. Looking to grow your career? That’s great! We believe in growing and cultivating careers here. There is...


  • Salt Lake, Utah, United States Danone Full time

    Overview and Position SummaryInfrastructure Systems SpecialistAt Danone, we are fostering a community of dedicated individuals committed to making a positive impact on the world we share. Our mission is to create a sustainable future through innovative practices and high-quality products.Joining our organization means becoming part of a vibrant team where...


  • Salt Lake, Utah, United States Comprehensive Recruiting Full time

    Infrastructure Project Specialist - Competitive Salary RangeElevate Your Career(Salary commensurate with experience)Enhance your professional journey with opportunities for advancement, job security, and a promotion that includes an attractive salary and a comprehensive benefits package.Exciting Career Opportunity: Our client delivers services to various...


  • Silver Lake, Kansas, United States Walmart Full time

    About the RoleWe are seeking a highly skilled Technical Support Specialist to join our team at Walmart Global Tech. As a key member of our infrastructure team, you will be responsible for providing technical support and expertise to ensure the smooth operation of our systems and infrastructure.Key ResponsibilitiesProvide technical support and troubleshooting...


  • Lake Crystal, Minnesota, United States League of Minnesota Cities Full time

    Job Summary:We are seeking a skilled Maintenance Technician to join our team at the League of Minnesota Cities. As a key member of our infrastructure team, you will be responsible for operating a wide range of equipment to construct, repair, and maintain city streets, utilities, parks, buildings, grounds, and other infrastructure.Key Responsibilities:Perform...


  • Liberty Lake, United States Alarm Full time

    Job OverviewPosition: Cloud Operations EngineerAbout UsAlarm is a leading provider of innovative security solutions, dedicated to enhancing safety and operational efficiency for businesses across various sectors. With a commitment to excellence, we deliver state-of-the-art technology and unparalleled customer support, ensuring our clients can protect their...


  • Lake Wildwood, United States SilverTech Full time

    About SilverTechSilverTech is a company that specializes in developing and supporting innovative solutions for the healthcare industry. We are part of the ResMed Group, a leading international company that provides medical devices and software solutions.Our MissionWe aim to improve the operation of our infrastructure and provide our customers with a seamless...


  • Salt Lake, Utah, United States Salt Lake City Full time

    Position Title:Water Infrastructure Maintenance Specialist IIJob Overview:Under general oversight, this role involves the operation of designated machinery and vehicles, including dump trucks, front-end loaders, compressors, and similar equipment, while executing specialized and routine maintenance tasks related to water systems. The position may also...


  • Salt Lake, Utah, United States FEDERAL RESERVE OF SAN FRANCISCO Full time

    Position Overview:The Federal Reserve Bank of San Francisco is dedicated to enhancing the nation's monetary, financial, and payment systems to foster a more robust economy. Our inclusive team is committed to creating an economy that benefits all. We are seeking a Senior Cloud Reliability Engineer to join our Data and Analytics Services Team, where you will...


  • Bonney Lake, United States Windermere Real Estate Part time

    Job DescriptionJob DescriptionWelcome to Windermere Real Estate - where being a Real Estate Specialist is more than just a job, it's a true calling. We believe in going above and beyond for our clients and communities, elevating and humanizing real estate every single day. We are the relationship heroes that make dreams come true.Are you ready to join...


  • Salt Lake, Utah, United States CIRCLE Full time

    About CircleCircle is a leading financial technology company that is revolutionizing the way value is transferred and stored. Our mission is to create an inclusive financial future, with transparency at our core.Job SummaryWe are seeking a highly skilled Senior Software Engineer to join our team and contribute to the development of our in-house blockchain...


  • Lake Mary, Florida, United States CentralSquare Technologies Full time

    Job SummaryCentralSquare Technologies is seeking a highly skilled IT Infrastructure Specialist to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and managing the infrastructure that supports our enterprise applications and systems.Key ResponsibilitiesDesign and Implementation: Design and...


  • Salt Lake, Utah, United States Rio Tinto Full time

    Job OverviewPosition: Reliability Maintenance SpecialistJoin a company that prioritizes the health and safety of its workforce above all else.Contribute to one of the largest copper mining operations globally.Advance your career with numerous opportunities for growth.Role SummaryWe are in search of a Reliability Maintenance Specialist to aid in achieving...


  • Leisure Village West-Pine Lake Park, New Jersey, United States Aurora Technologies Full time

    Job SummaryAurora Technologies is seeking a highly skilled IT Infrastructure Specialist to join our team. As a key member of our IT department, you will be responsible for the administration and maintenance of our server landscapes, ensuring seamless operation and optimal performance.Key ResponsibilitiesSupport the IT administration of our customers,...


  • Salt Lake, Utah, United States Goldman Sachs Full time

    About This RoleAt Goldman Sachs, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the availability and reliability of our firm's most critical platform services.Key ResponsibilitiesBalance feature development velocity and reliability with well-defined Service Level...


  • Salt Lake City, United States Channel Personnel Services Full time $130,000 - $150,000

    Job DescriptionJob DescriptionThe overall objective of the Reliability Superintendent is to ensure the safe and reliable operation of production facilities at the lowest life cycle cost. The Reliability Center Superintendent will be responsible for the assignment and supervision of work by all reliability specialists in the region and will ensure safe work,...


  • Salt Lake City, United States Channel Personnel Services Full time $130,000 - $150,000

    Job DescriptionJob DescriptionThe overall objective of the Reliability Superintendent is to ensure the safe and reliable operation of production facilities at the lowest life cycle cost. The Reliability Center Superintendent will be responsible for the assignment and supervision of work by all reliability specialists in the region and will ensure safe work,...


  • Salt Lake City, United States Channel Personnel Services Full time $130,000 - $150,000

    Job DescriptionJob DescriptionThe overall objective of the Reliability Superintendent is to ensure the safe and reliable operation of production facilities at the lowest life cycle cost. The Reliability Center Superintendent will be responsible for the assignment and supervision of work by all reliability specialists in the region and will ensure safe work,...


  • Moses Lake, United States Sila Nanotechnologies Full time

    Job OverviewAbout Sila NanotechnologiesWe are a pioneering company in the battery materials sector, dedicated to facilitating the global shift towards sustainable energy solutions. Our objective is to revolutionize lithium-ion batteries from the core, engineering and producing innovative materials that enhance energy density while minimizing size and weight....