NOTE: THIS POSITION IS TO JOIN AS W2 ONLY.
Data Analytics Engineer
Location: Hudson Yards, NY (Remote)
Duration: 3 Months
Project : Team is currently working on migrating data from Kinesis to Kafka. You will likely begin once migration has been completed or is on its final stages. Our team and you will need to troubleshoot for migration and post-migration (Notebooks, building SQL queries), migrate any unmoved date to the new system, support features for Client, and possibly anomaly detection.
Job Responsibilities / Typical Day In The Role
- Develop and maintain scalable data pipelines that process millions of records a day.
- Automate GitHub workflows, write unit/integration tests, contribute to engineering wiki, document work, help manage tech-debt and follow other engineering best practices.
- Build tools to enhance developer productivity.
- Implement processes, frameworks, and systems for anomaly detection, and to monitor data quality ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
- Work cross functionally with engineers, product managers, and analysts to help make data-driven decisions across the organization.
- Design and develop data integration framework.
- Design and evaluate open source and vendor tools for data lineage.
- Have an interest in learning and building ML models.
- Can flex software engineering, data engineering, data science and ML skills.
- Together with the rest of the engineering team, you will share an on-call rotation and be an escalation contact for service incidents.
- Ability to excel in a fast paced, startup-like environment.
- Strong Computer Science fundamentals.
- Excellent communication skills.
Must Have Skills / Requirements
- Proficiency in at least 1 coding language
- Java, python, C#, etc. Will need to be able to pick up code written in PySpark quickly if not previously known.
- Working knowledge of SQL
- Having experience working with any Relational Database: SQL server, Oracle, etc.
- Working with Open-Source Repository
- Github, Visual Studio Version Control, etc.
Nice To Have Skills / Preferred Requirements
- Worked with Databricks prior.
- Experience with ML (Machine Learning).
- Family oriented candidates.
- Candidates with interest in computer engineering.
- Candidates with traveling interests.
Soft Skills
Education / Certifications
- B/S or higher in Computer Science; or comparable (8+ YoE).
- 5+ YoE if candidate holds a degree.