Site Reliability Engineer

3 weeks ago


Town of Brookfield, United States Knak Digital Full time

About the Role This Site Reliability Engineer III role is a senior, hands‑on position serving as the PostgreSQL subject‑matter expert across a modern, cloud‑native data platform. You’ll play a critical role in database architecture, performance, reliability, security, and observability, partnering closely with application, security, and platform teams. You’ll design scalable database solutions, define observability standards, and help evolve a multi‑database ecosystem spanning relational and non‑relational technologies, while mentoring others and raising the bar for operational excellence. This is a role for someone who enjoys deep technical ownership, thoughtful systems design, and influencing how teams work at scale. What You’ll Do Database Architecture & Reliability Architect, implement, and maintain PostgreSQL databases on AWS Optimize schemas, indexing, and queries for availability, scalability, and performance Write and refactor high‑performance PL/pgSQL Proactively diagnose performance issues and implement long‑term solutions Observability & Performance Design database observability strategies and reusable monitoring patterns Build dashboards with real‑time visibility into health and performance Implement anomaly detection, alerting, and cost/performance tuning Recommend best practices for monitoring, tracing, and logging cloud databases Cloud & Platform Integration Leverage AWS services including Aurora Serverless, RDS, EC2, S3, Lambda Implement backup, recovery, and monitoring solutions using AWS‑native tools Recommend storage patterns and policies to optimize cost and reliability Ensure alignment with AWS security and operational best practices Security & IAM Implement and manage IAM policies for secure database access Partner with security teams on audits, vulnerability remediation, and compliance Ensure protection of sensitive and regulated data Non‑Relational & Search Platforms Design and optimize DynamoDB tables, partition strategies, and indexes Deploy and manage OpenSearch clusters for search and analytics Design and maintain DocumentDB instances Integrate Redis/Valkey, Kafka, Amazon SQS, and related technologies Data Integration & Governance Support data integration and ETL workflows across platforms Promote data governance, quality, and consistency best practices Support infrastructure‑as‑code efforts (Terraform) Collaboration & Mentorship Serve as an escalation point for complex database issues Perform root cause analysis and prevent recurrence Mentor engineers and DBAs; develop training and documentation Collaborate with development teams to improve application performance About You Core Qualifications 5+ years of hands‑on PostgreSQL experience, including advanced tuning Strong AWS experience (RDS, EC2, S3, Lambda, DynamoDB) Deep understanding of database design, normalization, indexing Experience with observability tools (Datadog, AWS monitoring) Proficiency in scripting (Python, Bash, or similar) Experience with both relational and non‑relational data platforms Strong troubleshooting, problem‑solving, and communication skills Authorized to work in the United States Nice to Have AWS certifications (Solutions Architect, Database, or Security) Experience refactoring or modernizing legacy systems Familiarity with regulated environments (HIPAA, PCI) Experience with Agile/Scrum delivery models Comfort with Atlassian tools (Jira, Confluence, OpsGenie) Exposure to AWS Glue, Neptune, or other purpose‑built databases Benefits Remote‑first U.S. role with occasional in‑person planning sessions Entrepreneurial, collaborative team where individuals have real ownership and voice Sustained growth with room to expand scope, skills, and career path Strong investment in learning & development, including mentoring and training Competitive compensation with geographic adjustments Well‑rounded benefits, including: Medical, dental, and vision coverage 401(k) with company match and profit sharing Paid parental leave and family‑building benefits Generous PTO starting day one + paid holidays Fully covered life, disability, and employee assistance programs #J-18808-Ljbffr


  • Site Reliability Engineer

    47 minutes ago


    Brookfield, United States Knak Digital Full time

    About the RoleThis Site Reliability Engineer III role is a senior, hands-on position serving as the PostgreSQL subject-matter expert across a modern, cloud-native data platform. You’ll play a critical role in database architecture, performance, reliability, security, and observability, partnering closely with application, security, and platform teams.What...


  • Town of Florida, United States Optomi Full time

    Overview This range is provided by Optomi. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $145,000.00/yr - $160,000.00/yr Cloud & Infrastructure Technical Recruiter @ Optomi | Bachelor of Science Site Reliability Engineer Optomi, in partnership with a leading global media organization...


  • Brookfield, Wisconsin, United States Knak Digital Full time

    About the RoleThis Site Reliability Engineer III role is a senior, hands-on position serving as the PostgreSQL subject-matter expert across a modern, cloud-native data platform. You'll play a critical role in database architecture, performance, reliability, security, and observability, partnering closely with application, security, and platform teams.What...


  • Town of Texas, United States SS&C Technologies Full time

    Overview Site Reliability Engineer (SRE) at SS&C Technologies. Remote opportunities available in multiple states. SS&C Technologies is a global investment and financial services software provider with a long-standing presence and a broad client base. About the Role The Site Reliability Engineer (SRE) is responsible for leading technology teams to deliver...


  • Town of Texas, United States Longbridge Securities Full time

    Longbridge is a fast-growing online brokerage platform on a mission to make investing smarter, simpler, and more accessible for everyone. Overview We are looking for a hands-on Site Reliability Engineer (SRE) to design, scale, and safeguard the reliability of our next-generation financial platforms. This is a high-impact role where you’ll partner closely...


  • Town of Brookfield, United States Milliman Full time

    A leading consulting firm is seeking a Site Reliability Engineer III focused on PostgreSQL database management and optimization. This role involves architecting and maintaining databases on AWS, ensuring performance, security, and compliance with best practices. The ideal candidate should have over 5 years of experience in PostgreSQL, strong AWS service...


  • Town of Florida, United States SS&C Technologies Full time

    Job Description As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world’s largest companies to small and mid‑market firms, rely on SS&C for expertise, scale, and...


  • Town of Brookfield, United States Milwaukee Tool Full time

    Overview Milwaukee Tool is seeking a Senior Reliability Engineer to contribute to the development and qualification of power tool products. The Senior Reliability Engineer is a critical member of the New Product Development Team reporting to the Quality/Reliability Manager. The role requires strong cross-functional communication and analytical skills to...


  • Town of Texas, United States Medium Full time

    Job Position: Blockchain Site Reliability Engineer Location: Dallas, TX, USA (Remote Acceptable - USA Applicants Only) Company: Contact: About Company InfStones is an advanced, enterprise-grade Platform as a Service (PaaS) blockchain infrastructure provider trusted by the top blockchain companies in the world. InfStones’ AI-based infrastructure provides...


  • Town of Woodbury, United States ExamWorks Full time

    Join to apply for the Site Reliability Engineer (31143) role at ExamWorks . ExamWorks is looking for a Site Reliability Engineer to join the team. This role is Monday‑Friday onsite in the Woodbury, NY office, during standard business hours. Essential Functions Support the availability and performance across all production environments supporting...