See more Collapse

Lead Data Science Engineer

1 month ago


Remote, Oregon, United States Cybrary Full time

Who We Are

Cybrary is the world leader in developing, measuring, and improving the skills of every cybersecurity professional and business worldwide. We believe the key to closing the cybersecurity skills gap is to arm cyber professionals with the skills they need to continuously develop in their field. Our enterprise-grade platform and rapidly expanding catalog support multiple types of learning, and allow for a personalized training experience for each user on the platform. With over 3 million users, Cybrary is well-positioned to accelerate its leadership position in the marketplace.

At Cybrary we value teamwork, collaboration, and trust. We look for people who like to win, embody grit, operate with a growth mindset, who love to learn, and approach each challenge with an open mind. We welcome diverse perspectives, and those who demonstrate the passion needed to disrupt the online and cybersecurity training industry. Our culture is shaped by fearless communication, nitro cold brew energy, collaborative community, and an up-for-anything attitude.

Headquartered in College Park, MD, Cybrary has been recognized as one of Washington D.C.'s Best Places to Work by Forbes and the Washington Post, and made Deloitte's Technology #Fast500 as one of the fastest growing technology companies in North America.


Cybrary is seeking an experienced Data Engineering Lead to join our team and drive the development and maintenance of our data infrastructure, pipelines, and warehousing solutions. This role will be responsible for ensuring efficient and reliable data flow, enabling data-driven decision-making across the organization. Cybrary currently uses Looker and various models across the organization, democratizing data access throughout the company.

Responsibilities:

  1. Data Infrastructure Management:
  • Work with Operations to manage and optimize the cloud infrastructure (GCP) for data processing and storage.
  • Ensure high availability, scalability, and performance of the data infrastructure.
Data Ingestion and Pipelines:
  • Design and expand robust data ingestion pipelines to collect and process data from various sources (current pipelines mostly use Cloud Run).
  • Optimize data pipelines for performance, reliability, and maintainability.
Data Warehousing:
  • Manage and optimize the data warehouse solution (Snowflake) for efficient data storage and querying.
  • Manage and improve the data transformation process in Snowflake.
  • Ensure data quality, integrity, and governance within the data warehouse.
Team Leadership and Collaboration:
  • Lead the data team in improving data infrastructure, including the above systems, as well as semantic modeling (Looker) and machine learning models.
  • Collaborate with the data team, engineering, and other business stakeholders to understand core business needs and implement solutions.

Requirements:

  • Minimum of 5 years of experience in data engineering roles, with a strong background in cloud infrastructure management, data pipelines, and data warehousing.
  • Proficient in programming languages such as Python, Scala, or Java, and experience with data engineering tools like Apache Spark, Apache Kafka, and Airflow.
  • Expertise in cloud platforms such as AWS, GCP, or Azure.
  • Hands-on experience with data warehousing solutions like Snowflake, Redshift, or BigQuery.
  • Strong understanding of data modeling, ETL/ELT processes, and data quality assurance methodologies.
  • Excellent problem-solving, analytical, and communication skills.
  • Ability to work collaboratively in a cross-functional team environment.

Preferred Qualifications:

  • Experience in our core stack (Python, GCP, Cloud Run, Airflow, Snowflake, Looker).
  • Experience leading data engineering / data science teams.
  • Knowledge of DevOps practices and tools (Docker, Kubernetes, Terraform).
  • Familiarity with data governance, security, and compliance best practices.
  • Experience with streaming data processing and real-time data pipelines.

If you are a passionate data engineering professional with a strong technical background and leadership skills, we invite you to apply for this exciting opportunity. Join our team and play a crucial role in shaping and scaling our data infrastructure to drive business growth and success.
This position is 100% remote, however the candidate must currently live within the United States.


What We Bring to the Table

Eligible employees qualify for a competitive total rewards package inclusive of the following benefits:

  • Competitive salaries that align with market and industry standards
  • 100% coverage of medical, vision, and dental insurance premiums for employees
  • 50% premium coverage for dependents
  • Available HSA and FSA programs
  • Company-paid life insurance coverage
  • Company-paid short-term and long-term disability coverage
  • Flexible sick and vacation leave, with dedicated parent and bereavement leave policies
  • Birthday Leave
  • 401(k) plan
  • Company-paid student loan repayment after 6 months of service
  • Referral bonus plan
  • Professional Development and Training reimbursement package

Cybrary is proud to be an Equal Opportunity Employer, and does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, physical or mental disability, national origin, veteran status or any other basis covered by applicable law.


We have other current jobs related to this field that you can find below


  • Remote, Oregon, United States Dotdash Meredith Full time

    About Your Role: Dotdash Meredith is the largest premium publisher in the world. Every day tens of millions of people come to us for help and inspiration. Our library of hundreds of millions of articles helps our users, and helps us understand the evolving needs of a world recovering from the pandemic and dealing with day-to-day challenges. As a director of...


  • Remote, Oregon, United States Dotdash Meredith Full time

    About Your Role: Dotdash Meredith is the largest premium publisher in the world. Every day tens of millions of people come to us for help and inspiration. Our library of hundreds of millions of articles helps our users, and helps us understand the evolving needs of a world recovering from the pandemic and dealing with day-to-day challenges. As a director of...

  • Data Science Actuary

    4 weeks ago


    Remote, Oregon, United States Counterpart Full time

    DATA SCIENCE ACTUARYCounterpart believes in small businesses and is dedicated to helping them do more with less risk. By pairing leading insurance experts with cutting-edge technology, Counterpart empowers small business owners to grow with confidence. Exceptional underwriters, trusted insurance brokers, and prominent insurance carriers come together on the...


  • Remote, Oregon, United States Dotdash Meredith Full time

    About Your Role: Dotdash Meredith is the largest premium publisher in the world. Every day tens of millions of people come to us for help and inspiration. Our library of hundreds of millions of articles helps our users, and helps us understand the evolving needs of a world recovering from the pandemic and dealing with day-to-day challenges. As a director of...


  • Remote, Oregon, United States Veda Data Solutions Full time

    Veda helps patients get the care they need by untangling complex data management problems using advanced scientific approaches and in-depth collaboration. Our technology reflects what our people provide: quality without ego, honesty backed by science, and warmth in an industry not known for having much heat. Veda is made up of talented professionals that are...


  • Remote, Oregon, United States Veda Data Solutions Full time

    Veda helps patients get the care they need by untangling complex data management problems using advanced scientific approaches and in-depth collaboration. Our technology reflects what our people provide: quality without ego, honesty backed by science, and warmth in an industry not known for having much heat. Veda is made up of talented professionals that are...

  • Data Engineer

    3 months ago


    Remote, Oregon, United States ResourceX Full time

    A Data Engineer Lead is responsible for optimizing our data and expanding the data pipeline infrastructure. Leads others in building products that will be consumed for data science artificial intelligence machine learning and other advanced analytic solutions. Works on cross-functional teams (Cloud Engineers Dev/Ops Engineers) to design and build...


  • Remote, Oregon, United States Appfire Full time

    Appfire builds next-generation enterprise collaboration solutions to liberate teams from silos and make work flow. By extending and enhancing what's possible on platforms like Atlassian, Microsoft, , Salesforce and more, Appfire enables companies to increase value from the many platforms they've invested in. Appfire empowers today's knowledge workers to plan...

  • Data Engineer

    4 weeks ago


    Remote, Oregon, United States M13 Full time

    Redefining Healthcare with CarenosticsAt Carenostics, we're at the forefront of healthcare AI, forging a path to address chronic diseases with transformative solutions. Our work, starting in Chronic Kidney Disease at Hackensack Meridian Health (HMH), has garnered prestigious accolades like the Bio-IT World Innovative Practices Award, placing us in the league...


  • Remote, Oregon, United States Teamshares Full time

    What is Teamshares?Teamshares is a mission-driven startup that buys small businesses from retiring owners and transitions them into enduring, employee-owned businesses through our software, education, and community products. There wasn't an easy way for small businesses—which make up 98% of firms in the US economy—to become employee-owned before...

  • Lead Data Scientist

    4 weeks ago


    Remote, Oregon, United States Protecht Full time

    Protecht is reinventing refunds, aiming to make every experience refundable. Our core strength lies in our proprietary Software-as-a-Service embedded refund protection platform, which delivers massive distribution and a best-in-class digital purchase experience to insurance carriers, event, booking, and ticketing platforms, and consumers.Role Overview:As the...

  • Data Engineer

    4 weeks ago


    Remote, Oregon, United States WireWheel Full time

    Data EngineerDesign, construct, and deploy data wrangling, processing, and analysis pipelines to extract operational intelligence from a variety of data sources using machine learning and data mining techniques for WireWheel, Inc.Write software programs and modify those written by others to perform specific data wrangling and transformation tasks.Analyze...

  • Lead Data Engineer

    7 days ago


    Remote, Oregon, United States Lumen Full time

    About LumenLumen connects the world. We are igniting business growth by connecting people, data and applications – quickly, securely, and effortlessly. Together, we are building a culture and company from the people up – committed to teamwork, trust and transparency. People power progress. We're invested in providing the flexibility you need to thrive...

  • Databricks Engineer

    4 weeks ago


    Remote, Oregon, United States NTT DATA Services Full time

    Databricks EngineerNTT DATA is a team of more than 190,000 diverse professionals, operating in more than 50 countries throughout the world. The sectors where we have activities include: telecommunications, finance, industry, utilities, energy, public administration and health.Our mission? Offer technological solutions, business, strategy, development and...

  • Lead Data Scientist

    4 weeks ago


    Remote, Oregon, United States Braviant Holdings Full time

    At Braviant, we believe in hiring great talent and offering them the flexibility to achieve great results unbounded by geography. Braviant is offering a fully remote option for anyone in the U.S. who wants to join our team and help us grow. We also have an office space in the heart of downtown Chicago for those who prefer to get out of the house and...

  • Data Engineer

    4 weeks ago


    Remote, Oregon, United States Jellyfish Full time

    As a member of Jellyfish Research, you will work closely with data scientists to understand and facilitate data needs for high-impact research, analysis and reporting. You will optimize, expand or redesign our current data platforms, including storage and pipeline architecture, to ingest, combine, and aggregate data from multiple sources. You'll also build...

  • Data Engineer

    4 weeks ago


    Remote, Oregon, United States Edmentum Full time

    WHO WE AREEdmentum is a dynamic educator and student-focused company dedicated to tech-enabled learning solutions. Our goal is to ensure that all students have access to flexible learning environments and educators have the tools they need to support their students. We are on a mission to create innovative, proven learning technology, partnering with...


  • Remote, Oregon, United States TrueML Full time

    Your Role:TrueAccord is looking for an Engineering Manager, Data Platform who will lead a team of data engineers to architect, develop, and maintain data pipelines using the latest technologies.Ideally, you have a proven track record in managing a group of world-class data engineers with demonstrated growth in their career development. You possess hands-on...

  • Senior Data Engineer

    4 weeks ago


    Remote, Oregon, United States Invoca Full time

    About Invoca:Invoca is the industry leader and innovator in AI and machine learning-powered Conversation Intelligence. With over 300 employees, 2,000+ customers, and $100M in revenue, there are tremendous opportunities to continue growing the business. We are building a world-class SaaS company and have raised over $184M from leading venture capitalists...

  • Senior Data Engineer

    4 weeks ago


    Remote, Oregon, United States Mediaocean Full time

    What You Will Do: As a member of the TVIQ team, you will work with a talented team developing a new cutting-edge linear and CTV media planning and buying platform with a heavy focus on data science and analytics. Your primary role will be the TVIQ team's data engineer and architect. You will be expected to make design decisions on how the platform functions...