Current jobs related to Python Pyspark - columbus - Centraprise

  • Python Spark Developer

    4 months ago


    Columbus, United States Diverse Lynx Full time

    Job Role : Python Spark Developer Location : Columbus, OH Experience : 9+ Years Responsibilities: Develop and maintain data platforms using Python, Spark, and PySpark. Handle migration to PySpark on AWS. Design and implement data pipelines. Work with AWS and Big Data. Produce unit tests for Spark transformations and helper methods. Create...


  • Columbus, United States Diamondpick Full time

    Duties and responsibilities • Collaborate with the team to build out features for the data platform and consolidate data assets • Build, maintain and optimize data pipelines built using Spark • Advise, consult, and coach other data professionals on standards and practices • Work with the team to define company data assets • Migrate CMS' data...


  • columbus, United States Pyramid Consulting, Inc Full time

    Immediate need for a talented Junior/ Mide-level Data Analyst. This is a 06+months contract opportunity with long-term potential and is located in Columbus OH (Remote). Please review the job description below and contact me ASAP if you are interested. Job ID:24-51261 Pay Range: $35 - $45/hour. Employee benefits include, but are not limited to, health...


  • columbus, United States Pyramid Consulting, Inc Full time

    Immediate need for a talented Junior/ Mide-level Data Analyst. This is a 06+months contract opportunity with long-term potential and is located in Columbus OH (Remote). Please review the job description below and contact me ASAP if you are interested. Job ID:24-51261 Pay Range: $35 - $45/hour. Employee benefits include, but are not limited to, health...


  • Columbus, United States Pyramid Consulting, Inc Full time

    Immediate need for a talented Junior/ Mide-level Data Analyst. This is a 06+months contract opportunity with long-term potential and is located in Columbus OH (Remote). Please review the job description below and contact me ASAP if you are interested. Job ID:24-51261 Pay Range: $35 - $45/hour. Employee benefits include, but are not limited to, health...


  • Columbus, United States Pyramid Consulting, Inc Full time

    Immediate need for a talented Junior/ Mide-level Data Analyst. This is a 06+months contract opportunity with long-term potential and is located in Columbus OH (Remote). Please review the job description below and contact me ASAP if you are interested. Job ID:24-51261 Pay Range: $35 - $45/hour. Employee benefits include, but are not limited to, health...

  • Senior Data Engineer

    6 months ago


    Columbus, United States g2o Full time

    Your future starts here Imagine being part of a team that helps clients build better relationships with customers. When you join us, you'll help top-notch clients to execute the digital strategies of the future. Every day, we collaborate with clients and each other to provide technology expertise, human-centered design and industry experience to...


  • Columbus, United States Huntington National Bank Full time

    Description Summary: Huntington Bank is looking for a Lead Business Systems Analyst (BSA) in our Enterprise Data Warehouse (EDW). In this role you will be part of a team working to develop solutions enabling the business to leverage data as an asset at the bank. As a Lead BSA Analyst, you will work with the business to understand their needs,...

Python Pyspark

1 month ago


columbus, United States Centraprise Full time

Job Title : Python Pyspark

Job Location : Columbus, OH (Fully ONSITE from Day1)

Job Type : Long term Contract-W2


Job Description:

Position summary

A Data Engineer at CMS is a software engineer with proficiency in data. The data engineer will

build and maintain the CMS data warehouse which is used for both reporting and analytics

across the company. The individual works cross functionally with technical and business teams

to identify opportunities to better leverage data. The data comes from a variety of sources and it

is the responsibility of the data engineer to make sense of the data using cloud based systems

(AWS) and provide a reliable and structured format to meet the different business needs at

CMS.

Duties and responsibilities

● Collaborate with the team to build out features for the data platform and consolidate data assets

● Build, maintain and optimize data pipelines built using Spark

● Advise, consult, and coach other data professionals on standards and practices

● Work with the team to define company data assets

● Migrate CMS’ data platform into Chase’s environment

● Partner with business analysts and solutions architects to develop technical

architectures for strategic enterprise projects and initiatives

● Build libraries to standardize how we process data

● Loves to teach and learn, and knows that continuous learning is the cornerstone of every

successful engineer

● Has a solid understanding of AWS tools such as EMR or Glue, their pros and cons and

is able to intelligently convey such knowledge

● Implement automation on applicable processes

The ideal candidate

● 5+ years of experience in a data engineering position

● Proficiency is Python (or similar) and SQL

● Strong experience building data pipelines with Spark

● Strong verbal & written communication

● Strong analytical and problem solving skills

● Experience with relational datastores, NoSQL datastores and cloud object stores

● Experience building data processing infrastructure in AWS

● Bonus: Experience with infrastructure as code solutions, preferably Terraform

● Bonus: Cloud certification

● Bonus: Production experience with ACID compliant formats such as Hudi, Iceberg or

Delta Lake

● Bonus: Familiar with data observability solutions, data governance frameworks

Requirements

Bachelor’s Degree in Computer Science/Programming or similar is preferred

Right to work

Must have legal right to work in the USA.