Sr. Cloud IT Reliability Engineer
2 weeks ago
Job DescriptionJob Description
Cloud Infrastructure Administrator
Role Overview
- As an AWS Systems Admin, you will be responsible for Site Reliability and Configuring, Administering, and Supporting AWS environments within the IT infrastructure/operations team in a Linux (Ubuntu) and Windows server environment
- Run the production environment by monitoring availability and taking a holistic view of system health
- Support systems to manage platform infrastructure and applications
- Improve reliability, quality, and time-to-market of software solutions in a cloud environment
- Measure and optimize system performance, with an eye toward pushing our capabilities forward and innovating to continually improve the cloud/infrastructure environment
- Provide primary operational support for multiple large, distributed software applications
- Implementing best practices, ensuring maintenance and backups are being completed, making recommendations for AWS services and automation opportunities.
- Monitor current AWS workloads and provide early warning to the business about impacted services and resolutions.
- Works on the infrastructure team but collaborates with the applications team and corporate Infrastructure teams to ensure business needs are being met and projects are completed in a timely manner.
- Ensure the environments are securely implemented and generating all necessary security logs.
Responsibilities include:
- Configuration and support EC2, S3, Autoscaling, CloudFront, CloudWatch, IAM security services as needed.
- Administration of the development, test and production cloud hosted environments.
- Provide issues/needs analysis and solution recommendation/implementation relative to system’s needs.
- Work directly with application engineers to identify and resolve issues
- Ensure system performance, uptime and support levels meet or exceed SLAs.
- Assist project team as needed with capacity planning and all AWS service and application deployments
- Manage virtual and physical cloud resources as required with an overall objective of improving the scalability, reliability, performance, and availability of the cloud infrastructure.
- Develop a detailed understanding of application functionality and architecture.
- Partner with application teams to develop practical monitoring solutions and participate in cross functional team meetings to collaborate and ensure successful executions.
- Maintain internal documentation that fully reflects all activity related to an application and environment to be used by applicable teams
- Troubleshoot and resolve operational issues, assisting with issues arising from product upgrades, installations, and configurations
- All other duties as assigned
Job Requirements
Education:
- Bachelor’s degree in Management Information Systems, Computer Science or related major, or equivalent experience required. Equivalent years of experience are defined as one year of professional experience for each year of college requested
Experience:
- 5+ years of experience operating in a production environment and experience with Systems support with either Linux (pref Ubuntu Linux), and / or Windows environments
- 3+ years of experience in Infrastructure and systems support and administration in a cloud or hybrid environment
- 2+ years of experience within Cloud Infrastructure configuration management, administration, & support
- 2+ years of experience with AWS (Azure experience is acceptable, provided there is some knowledge of AWS)
- Experience in AWS system administration and configuration management and knowledge of AWS versioning system.
- Experience with AWS services, e.g., EC2, CloudWatch, RDS, S3, EKS, VPC, etc. Support and configuration management activities including Cloud Infrastructure; Cloud Provisioning; Cloud Service Management; AWS Relational Database Service (RDS); Amazon VPC; Amazon S3; Amazon Web Services EC2; Amazon Web Services VPC; including Cloud back up and restoration processes.
- Experience with troubleshooting and resolving operational issues, assisting with issues arising from product upgrades, installations, and configurations.
- Experience of clustering, backup configuration and DR exercises
- Experience with AWS CLI (Command line interface) for automating administrative tasks
- Ability to work in a distributed team environment where team members are spread across numerous locations and often communicate virtually to support clients
- Strong written and verbal communications skills
- Basic Scripting experience (Any )
Skills & Certifications:
- Experience with Site Reliability Engineering (SRE)
- Experience with VPC, AZs, Subnets, Route53, CloudWatch, ALB/NLB, Security Groups, EKS, and EC2.
- AWS Certifications
- Experience working through a cloud transformation working with both on prem technologies and Public Cloud and ideally having helped move on prem technologies to the AWS Public Cloud
- Knowledge of networking concepts, e.g., OSI model, DNS, TCP/UDP, and IPv4/IPv6 and experience with AWS network connectivity configuration and network security (using security groups, keys etc.) and maintaining AWS network connectivity for standard Internet services such as VPC, VPN, DNS, NFS, DHCP and FTP.
- Automation, Orchestration & Provisioning; Container orchestration experience using AWS EKS clusters in a production environment.
- Remediate vulnerabilities/patching operating systems using AWS Systems Manager, creating hardened AWS AMIs, and other security related activities.
- Devops: Terraform; Docker; Ansible; Terraform, Ansible, Git, YAML JavaScript Object Notation (JSON);
- Supporting multiple continuous build environments, code repository administration, and code packaging and deployments to multiple development, QA and production environments.
Supervisory Responsibility:
- This position does not include supervisory responsibilities
FLSA Status
- This position is classified as Salaried Exempt, and is not eligible for payment of time-and-one-half the regular rate of pay for hours worked over 40 in a week.
Work Environment & Physical Requirements
Work Environment:
- This job operates in normal professional office environments, 2 to 5 days on-site, and routinely uses standard office equipment such as computers, phones, photocopiers and fax machines, and filing cabinets.
Physical Requirements:
- While performing the duties of this job the employee is regularly required to talk or hear; frequently sits, stands, walks, uses hands handle or feel; and reaches with hands and arms.
-
Cloud Reliability Engineer
3 days ago
Fort Bragg, North Carolina, United States Venatore Llc Full timeJob OverviewThe Venatore LLC is seeking a Cloud Site Reliability Engineer to join our team. This role will ensure the seamless operation of our cloud infrastructure, focusing on reliability, security, and scalability.We are looking for an experienced professional with expertise in distributed storage technologies, container orchestration, and cloud...
-
AWS Cloud Engineer
1 week ago
Windsor Mill, United States Omm IT Solutions Full timeJob OverviewOmm IT Solutions is seeking a seasoned AWS Cloud Engineer to join our team. As a key member of our cloud engineering team, you will be responsible for designing, implementing, and maintaining scalable, secure, and high-performance cloud architectures on AWS.About the Role:Design and implement cloud architectures on AWS.Analyze and troubleshoot...
-
Site Reliability Engineer
1 day ago
Fort Mill, United States Coforge Full timeJob Title: Site Reliability EngineerExperience: +10 Years Skills: .NET, SQL, React, Dynatrace, AWS, Splunk, Elastic Stack, Python, Scripting Languages, Ansible Tower, TerraformLocation: Fort Mill SCWe at Coforge are hiring for a Site Reliability Engineer with the following skills:Responsibilities:Lead development of SRE dashboard.Lead development and...
-
Site Reliability Engineer
2 days ago
Fort Mill, United States Coforge Full timeJob Title: Site Reliability EngineerExperience: +10 Years Skills: .NET, SQL, React, Dynatrace, AWS, Splunk, Elastic Stack, Python, Scripting Languages, Ansible Tower, TerraformLocation: Fort Mill SCWe at Coforge are hiring for a Site Reliability Engineer with the following skills:Responsibilities:Lead development of SRE dashboard.Lead development and...
-
Coforge | Site Reliability Engineer
3 days ago
fort mill, United States Coforge Full timeJob Title: Site Reliability EngineerExperience: +10 Years Skills: .NET, SQL, React, Dynatrace, AWS, Splunk, Elastic Stack, Python, Scripting Languages, Ansible Tower, TerraformLocation: Fort Mill SCWe at Coforge are hiring for a Site Reliability Engineer with the following skills:Responsibilities:Lead development of SRE dashboard.Lead development and...
-
Cloud Engineer II
2 weeks ago
Fort Mill, South Carolina, United States LPL Financial Full timeCompany OverviewLPL Financial is a leading independent broker-dealer, providing an integrated platform of proprietary technology, brokerage, and investment advisor services. Our mission is to offer objective financial guidance and support our advisors in delivering exceptional client experiences.Salary RangeThe estimated salary for this role is $37.16-$61.93...
-
Senior Cloud Engineer
3 days ago
Fort Mill, South Carolina, United States ZipRecruiter Full timeJob Title: Senior Cloud EngineerAbout the Role:We are seeking a highly skilled Senior Cloud Engineer to join our team at ZipRecruiter. As a key member of our IT infrastructure team, you will be responsible for designing, implementing, and managing our cloud infrastructure. Your expertise in cloud computing will enable us to deliver high-quality services to...
-
Senior Cloud Solutions Engineer
2 weeks ago
Fort Belvoir, United States SMX Corporation Full timeAbout Our OpportunityWe are seeking a highly skilled Sr. Cloud Solutions Engineer to join our team at SMX Corporation. As a key member of our cloud services group, you will be responsible for designing and implementing secure, scalable, and reliable cloud infrastructure solutions.Job SummaryDesign and implement cloud infrastructure solutions using commercial...
-
Windsor Mill, United States Omm IT Solutions Full timeOmm IT Solutions is seeking a highly skilled Cloud Infrastructure Engineer to design, administer, optimize, and secure Red Hat Enterprise Linux v6.x/7.x/8.x environments. The ideal candidate will have experience with automated deployment and configuration tools for RHEL PaaS, as well as proficiency in using Red Hat Satellite for deployment, management, and...
-
Senior Cloud Infrastructure Specialist
1 week ago
Windsor Mill, United States Omm IT Solutions Full timeAbout the Job:">We are seeking a highly skilled Senior Cloud Infrastructure Specialist to join our Omm IT Solutions team. This is an exciting opportunity for a talented individual to leverage their expertise in cloud infrastructure, engineering, and operations to drive success in our dynamic environment.">Job Responsibilities:">The ideal candidate will be...
-
Sr. Software Engineer, Full Stack
3 weeks ago
Fort Worth, United States Capital One Full timeSoftware Engineer, Full Stack (Cloud Operations Resilience Engineering)Sr. Software Engineer - Full Stack (Cloud Operations Resilience Engineering)Do you love building and pioneering in the technology space? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who solve real problems and meet real customer needs. We are...
-
Site Reliability Engineer
3 days ago
Fort Smith, United States Wizeline Full timeCommunity ImpactWe are proud to contribute to local economies by developing technology ecosystems in places like Mexico, Colombia, and Vietnam. We also created , a free, community-based education program that teaches high-value skills to workers looking to advance their tech industry careers. As of 2022, Academy has served more than 28,000 students across...
-
Senior Cloud Solutions Engineer
3 days ago
Fort Belvoir, United States SMX Corporation Full timeJob Description:\As a Sr. Cloud Engineer, you will play a critical role in leading the design, implementation and migration of client workloads into a streamlined multi-cloud environment.\About the Company:\SMX Corporation is a trusted provider of digital transformation solutions to government agencies and organizations.\About the Job:\\Lead the design,...
-
AWS Cloud Run Architect
1 month ago
Fort Mill, United States Lorven Technologies Full timePosition: AWS Cloud Run Architect Location: Fort Mill, SC Duration: Contract Job Description: AWS Cloud Engineer would be responsible for implementing & and maintaining cloud infrastructure and services. The role involves working with cloud providers, developing automation scripts, ensuring security and compliance, and collaborating with cross-functional...
-
Cloud Engineering Director
4 weeks ago
Fort Mill, South Carolina, United States LPL Financial Holdings, Inc. Full timeLPL Financial Holdings, Inc. is a leading independent broker-dealer with a strong commitment to innovation and customer satisfaction.We are seeking a seasoned Cloud Engineering Director to lead our mission to achieve software delivery velocity as a key competitive advantage in the market.The successful candidate will drive engineering solutions aligned with...
-
Technical Lead, Cloud Engineering
4 weeks ago
Fort Worth, Texas, United States Indotronix International Corporation Full timeAt Indotronix International Corporation, we are seeking a skilled Sr Developer, IT Application to join our team. This role is an exciting opportunity for a talented individual to work with cutting-edge technologies and contribute to the success of our organization.About the Role:We are looking for a highly motivated and experienced professional to lead our...
-
Cloud Network Architect
2 weeks ago
Windsor Mill, United States Omm IT Solutions Full timeAbout the RoleThis role involves working with cloud service providers to configure and provision networking and storage services. The successful candidate will be responsible for engineering and supporting the network design, deployment, and operations for a large government customer located in Baltimore, Maryland.Requirements5-8 years of experience with a...
-
Senior IT Solutions Engineer
1 week ago
Windsor Mill, United States Omm IT Solutions Full timeKey Responsibilities:* Design and implement scalable, secure, and high-performance cloud-based solutions on the AWS platform* Collaborate with cross-functional teams to identify and prioritize technical requirements* Develop and maintain comprehensive documentation of cloud architecture and infrastructure* Provide technical leadership and guidance to junior...
-
Network Engineer Level III
1 month ago
Windsor Mill, United States Omm IT Solutions Full timeJob Description Please Note: The client is looking for all candidates to be local to the Maryland area. Or, they should at least be located 90-minute drive from Windsor Mill, Durham and NC. Job Description: This Network Engineer will be working on innovative network design and operations. This position will support various customers within Health Solutions....
-
AWS Cloud Engineer
3 days ago
Fort Belvoir, United States Reflexive Concepts Full timeReflexive Concepts is seeking an experienced AWS Cloud Engineer to join our team in Ft. Belvoir, VA.Job OverviewWe are looking for a highly skilled professional with expertise in AWS, Linux, and big data solutions. The ideal candidate will have a solid understanding of configuration management, automation, scripting, and infrastructure implementation.Main...