We're on the hunt for a talented lead data engineer to spearhead our data projects and inspire our talented data team towards unparalleled innovation.
About ImportGenius
ImportGenius is the pioneer company in data analytics for the global import/export industry. Our trade data is used by import/export businesses the world over to gain an advantage over their competitors. The insights we produce using our trade data have been used by analysts and journalists to predict the iPhone launch even before the official announcement, track counterfeit money flowing into Venezuela, and most recently to discover priceless antiques being shipped out of Russia by oligarchs.
Duties & Responsibilities
- Use your creativity to find and extract useful insights from Global Trade Data
- Build and maintain scalable ETL pipelines that will perform complex transformations in a parallel fashion over large-scale datasets
- Work with stakeholders including the Executive, Product, Data, and Design teams to assist with data-related technical issues and support their needs.
- Conduct upskill or skill transfer training from time to time (including but not limited to Junior Data Engineers)
- Spearhead the implementation of industry-standard world-class practices in data engineering
- Help in the implementation and scalability of machine learning techniques in the processing of trade data
- Manage the day-to-day activities of the data engineers and conduct regular evaluation
- Help in the implementation and scalability of machine learning techniques in the processing of trade data
- Perform such other duties as customarily performed by a professional in other, same, or similar engagements.
Qualifications & Skills Required
Programming Language: Python
Data Processing Framework & Engine: Apache Spark
Data Stack: AWS Glue, S3, Lambda, Redshift, OpenSearch, Athena, QuickSight
Cloud Service Provider: AWS
DBMS: MySQL, MongoDB
- Minimum 3 years of professional experience in managing/leading a team
- Excellent written and verbal English communication skills
- Driven with an intrinsic motivation to succeed and continuously improve yourself and your surroundings
- Creative and able to find and build solutions to complex problems
- Experience designing and building data warehouses and associated topologies
- Strong experience with Relational, Non-Relational, and Columnar Databases
- Innovative “out of the box” thinking. This role is central to the company’s competitive advantage in providing unique insights using publicly available data
- Background in Statistical Analysis
- Ability to liaise with management and other teams
- A basic grasp of statistics and probability will be a plus
- Experience with Hadoop a plus
More about this role:
- Work Arrangement: Hybrid 2x a MONTH
- Work schedule: 7am - 4pm PHT
- Benefits: 13th month and leave credits
- Laptop shall be provided