Data Engineer, Research

1 week ago


Houston, United States AEG Full time
In order to be considered for this role, after clicking "Apply Now" above and being redirected, you must fully complete the application process on the follow-up screen.

Department: Baseball Operations, Research and Development Supervisor: Sr. Director, Research and Development Classification: Full-Time/Exempt Location: Houston, TX

The Houston Astros baseball organization is accepting applications for a Data Engineer to join our Research & Development team within Baseball Operations. We are seeking an applicant to support the growth of our data architecture using cloud-based data lake technologies. This role will work within a team of software developers supporting the broad need of Baseball Operations and will be central to the workflow of departments across the organization, including opportunities to interface with and understand the needs of other departments and drive creative solutions.

Essential Functions & Responsibilities Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
  • Collaborate with the team on the design and implementation of a cloud-based data architecture.
  • Leverage Spark-based solutions to develop and maintain data processing pipelines that provide efficient access to data at various stages of transformation.
  • Integrate structured, semi-structured, and unstructured data sources, handling various formats including Parquet, JSON, and more.
  • Automate workflows and monitoring procedures to promote a maintainable infrastructure.
  • Write clean and iterative code and leverage continuous integration practices to deploy, support and operate data pipelines.
  • Interact with stakeholders internal to R&D (research analysts, application developers, ) and external to understand their needs from our architecture and data.
  • Participate in a rotating on-call schedule to tend to any immediate issues with our architecture and data.
  • Perform other duties as assigned.


Qualifications
  • Experience with one or more cloud platforms such as Azure, AWS, GCP.
  • Experience building and maintaining ETL processes with Databricks, Snowflake, or other data lake technologies.
  • Experience with Apache Spark (especially PySpark) a plus.
  • Proficiency with Python, including best practices and OOP design.
  • Proficiency with SQL and relational database structures.
  • Experience working on software teams and promoting software development best practices, including continuous integration, documentation, process automation, and monitoring.
  • Resilient in evolving environments and advocates for technical excellence.


Work Environment This job operates in an office setting. This role routinely uses standard office equipment such as computers, phones and photocopiers. The noise level is usually moderate but can be loud within the stadium environment.

Physical Demands While performing the duties of this job, the employee is occasionally required to stand; walk; sit (for long periods of time); use hands to handle or feel objects, tools or controls; reach with hands and arms; climb stairs; talk or hear. The employee may occasionally lift or move equipment, up to 20 pounds. Specific vision abilities required by this job include close and focused vision.

Position Type and Expected Hours of Work Ability to work a flexible schedule, including extended hours, evenings, weekends, and holidays.

Travel Some travel may be expected in this role.

Other Duties Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice.

We are an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.

EOE/M/F/Vet/Disability
Experience
Preferred
  • 5


  • Houston, United States Houston Methodist Full time

    Overview At Houston Methodist, the Research Data Engineer position is responsible for building and managing the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of clinical and population-health data sources using big data technologies in partnership with Houston Methodist IT. The position entails...


  • Houston, United States Houston Astros Full time

    Department: Baseball Operations, Research and Development Supervisor: Sr. Director, Research and Development Classification: Full-Time/Exempt Location: Houston, TX The Houston Astros baseball organization is accepting applications for a Data Engineer to join our Research & Development team within Baseball Operations. We are seeking an applicant to support...

  • Data Engineer

    1 week ago


    Houston, United States SysMind Tech Full time

    Title: Data Engineer Location: Houston, TX (On-site) Client: Quest Global/Halliburton Roles: This role is a part of the Production Enhancement Data Backbone team. The team, as well as a successful candidate, will create useful data products to be used in operations, data science modeling, business intelligence, and reporting. Overview: The candidate...


  • Houston, Texas, United States Dimensional Fund Advisors, L.P. Full time

    About Dimensional Fund Advisors, L.P.We're seeking a highly skilled Senior Data Engineer to join our team. As a Senior Data Engineer at Dimensional, you'll be responsible for designing and developing robust, secure, and maintainable data architectures, pipelines, and services.Key Responsibilities:Design and engineer data solutions across various domains,...


  • Houston, Texas, United States MD Anderson Cancer Center Full time

    Company Overview: The University of Texas MD Anderson Cancer Center is a world-renowned cancer research and treatment center. Our institution has the potential to unlock the power of data by further developing and investing in talent, team science, and infrastructure to optimize multidimensional data integration, analysis, and application for the benefit of...

  • Data Engineer

    2 months ago


    Houston, United States Jobot Full time

    Job DescriptionJob Description100% Remote, Silicon Valley Health & Wellness start-up, Salary + EquityThis Jobot Job is hosted by: Duran WorkmanAre you a fit? Easy Apply now by clicking the "Apply Now" buttonand sending us your resume.Salary: $70,000 - $85,000 per yearA bit about us:We are currently seeking a highly skilled and motivated Data Engineer to join...


  • HOUSTON, United States Sysco Full time

    Essential Functions: • Manage all aspects of the data and analytics system from stream configuration to ETL to aggregate tables and cubes for reporting needs. Establish and maintain the data pipeline architecture. Anticipate and understand the impact of technical decisions across the organization. • Partner with all facets of the business to include...


  • HOUSTON, United States US6469 Sysco Payroll, Division of Sysco Resources Services, LLC Full time

    Essential Functions: • Manage all aspects of the data and analytics system from stream configuration to ETL to aggregate tables and cubes for reporting needs. Establish and maintain the data pipeline architecture. Anticipate and understand the impact of technical decisions across the organization. • Partner with all facets of the business to include...


  • HOUSTON, United States US6469 Sysco Payroll, Division of Sysco Resources Services, LLC Full time

    Essential Functions: • Manage all aspects of the data and analytics system from stream configuration to ETL to aggregate tables and cubes for reporting needs. Establish and maintain the data pipeline architecture. Anticipate and understand the impact of technical decisions across the organization. • Partner with all facets of the business to include...


  • Houston, United States Sysco Full time

    Essential Functions: • Manage all aspects of the data and analytics system from stream configuration to ETL to aggregate tables and cubes for reporting needs. Establish and maintain the data pipeline architecture. Anticipate and understand the impact of technical decisions across the organization. • Partner with all facets of the business to include...


  • Houston, United States Amerit Consulting Full time

    Our client, a US Fortune 250 company and a global Medical technology corporation serving customers in Clinical Labs, Health care research & Pharmaceutical industry, seeks a Lead Cloud Data Engineer for a 100% remote role. Key Responsibilities: Lead a team of data engineers in designing, building, and maintaining high-performance software systems. Architect,...

  • Data Engineer

    1 week ago


    Houston, United States Apex Health Solutions Full time

    Description Summary: The Data Engineer will support the implementation of projects focused on collecting, aggregating, storing, reconciling, and making data accessible from disparate sources to enable analysis and decision-making. In this role, the Data Engineer will play a critical role in connecting our partners and enabling our critical solutions using...

  • Research Engineer I

    1 month ago


    Houston, United States Houston Methodist Academic Institute Full time

    At Houston Methodist, the Research Engineer I position is responsible for providing technical support for laboratory-based research. This position, under the direction of a Principle Investigator (PI), may support bench techniques, working with laboratory animals, scientific writing, and data analysis. The Research Engineer I position may oversee equipment...


  • Houston, Texas, United States MD Anderson Full time

    About the Role">MD Anderson, a world-renowned institution in cancer research and treatment, seeks highly motivated and enthusiastic Data Scientists Assistants to join its esteemed team. These individuals will play a vital role in the analysis and development of algorithms and software for clinical, operational, and scientific problems.">Key...

  • Research Engineer I

    1 month ago


    Houston, United States Houston Methodist Academic Institute Full time

    At Houston Methodist, the Research Engineer I position is responsible for providing technical support for laboratory-based research. This position, under the direction of a Principle Investigator (PI), may support bench techniques, working with laboratory animals, scientific writing, and data analysis. The Research Engineer I position may oversee equipment...


  • Houston, Texas, United States Houston Journal of Health Law & Policy Full time

    Company Overview">">Houston Journal of Health Law & Policy is a leading organization dedicated to advancing the understanding of health law and policy.">">About the Role">We are seeking an exceptional Data Analyst and Research Coordinator to join our team. As a key member of our research team, you will play a crucial role in supporting our mission by...

  • Research Engineer I

    3 months ago


    Houston, United States Houston Methodist Academic Institute Full time

    At Houston Methodist, the Research Engineer I position is responsible for providing technical support for laboratory-based research. This position, under the direction of a Principle Investigator (PI), may support bench techniques, working with laboratory animals, scientific writing, and data analysis. The Research Engineer I position may oversee equipment...

  • Data Engineer

    1 week ago


    Houston, United States TEKsystems Full time

    *Description:* Top Skills' Details 1. Snowflake 2. SQL 3. Upstream Oil & Gas - operations & production, GGRE, wells, supply chain 4. Work hand-in-hand with stakeholders to migrate, integrate, and test all of the data across the industry This project is to impact the validation & migration of data within operations & production, maintenance and reliability,...


  • Houston, Texas, United States MD Anderson Cancer Center Full time

    About the RoleWe are seeking an exceptional Data Scientist to join our team at MD Anderson Cancer Center. As a key member of our research laboratory, you will play a critical role in analyzing single-cell multiome and spatial transcriptomics data to uncover novel insights into cancer biology.Key ResponsibilitiesDevelop and implement computational pipelines...

  • Data Engineer

    1 week ago


    Houston, United States Chord Energy Full time

    Position Summary Chord Energy is looking for an experienced Data Engineer to join our Data Solutions Team. This individual must have a strong foundation in data engineering and experience building data analytics solutions. The successful candidate will work with Microsoft Azure Data Factory, Airbyte, DBT or similar data pipeline tools, and will have...