Big Data
2 weeks ago
Location: Jersey City, NJ
Duration: Full-time
Job Description
Big Data (PySpark) Tech Lead-
- 10 + Years Overall Experience in Data Management, Data Lake and Data Warehouse
- 6+ Years Hadoop, Hive, Sqoop, SQL, Teradata
- 6+ Years PySpark(Python and Spark), Unix
- Good to have Industry leading ETL experience
- Banking Domain experience
Key Responsibilities
- bility to design, build and unit test applications on Spark framework on Python.
- Build PySpark based applications for both batch and streaming requirements, which will require in-depth knowledge on majority of Hadoop and NoSQL databases as well.
- Develop and execute data pipeline testing processes and validate business rules and policies
- Optimize performance of the built Spark applications in Hadoop using configurations around Spark Context, Spark-SQL, Data Frame, and Pair RDD's.
- Optimize performance for data access requirements by choosing the appropriate native Hadoop file formats (Avro, Parquet, ORC etc) and compression codec respectively.
- bility to design & build real-time applications using Apache Kafka & Spark Streaming
- Build integrated solutions leveraging Unix shell scripting, RDBMS, Hive, HDFS File System, HDFS File Types, HDFS compression codec.
- Build data tokenization libraries and integrate with Hive & Spark for column-level obfuscation
- Experience in processing large amounts of structured and unstructured data, including integrating data from multiple sources.
- Create and maintain integration and regression testing framework on Jenkins integrated with BitBucket and/or GIT repositories
- Participate in the agile development process, and document and communicate issues and bugs relative to data standards in scrum meetings
- Work collaboratively with onsite and offshore team.
- Develop & review technical documentation for artifacts delivered.
- bility to solve complex data-driven scenarios and triage towards defects and production issues
- bility to learn-unlearn-relearn concepts with an open and analytical mindset
- Participate in code release and production deployment.
- Challenge and inspire team members to achieve business results in a fast paced and quickly changing environment
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
-
Java+Angular Full stack
7 months ago
Pennington, United States Diverse Lynx Full time9 + years of experience in Java application development end to end. Must Have - Pure hands on core java development experience. Hands on experience in multi-threading, synchronization, collections, Java streams, JDBC. 5 + years of experience in Java UI technologies HTML5/Angular JS/Backbone JS/Bootstrap. 5+ years of experience in using caching products like...
-
Expert Java Developer for Distributed Systems
2 weeks ago
Pennington, New Jersey, United States Diverse Lynx Full timeAbout the RoleWe are seeking an experienced Java Developer to join our team at Diverse Lynx LLC.This is a full-time position that requires strong expertise in Java application development, including end-to-end solutions, multi-threading, and UI technologies. The ideal candidate will have at least 9 years of experience in these areas.Key...
-
Java API Full stack developer
5 months ago
Pennington, United States Diverse Lynx Full timeRole: Java API Full stack developer Location: Pennington NJ/ Kennesaw, GA / New York City, NY - Onsite Job Job Type: Full Time Experience: 9 Year Skill: 9 + years of experience in Java application development end to end. Must Have - Pure hands on core java development experience. Hands on experience in multi-threading, synchronization, collections, Java...