Job Title: Data Engineer
Experience: 3-5years
Education: Masters(preferred)/Bachelors in Computer Science, Mathematics, Statistics, Engineering, Information Technology or related
Core Skills:
- Strong Software Engineering background, in Java, RESTful APIs, any experience with Springboot is valued.
- Strong Python skills along with popular and widely used Python libraries, performing data cleansing, wrangling, augmentation,
Exploratory data analysis, building cloud data pipelines, data lineage, governance(bronze, silver, gold), Data Quality analysis, Featurization techniques, sampling.
- SQL, NoSQL Databases, Elasticsearch.
- Awareness of Vector databases, LM/LLM/GEN-AI based tools, libraries and frameworks(e.g; LangChain, Agentic, Semantic Kernel).
- Awareness of Client models like Random forest, KNN, Time series forecasting
- Awareness of Neural Network Architectures, Transformers, ANN, BERT, GPT models, Open Source foundational models(e.g; LLama2/3).
- Awareness/usage of mlFlow.
- Cloud(Azure is valued over AWS/GCP) based managed services(e.g; Azure OpenAI, Azure Client, Azure Data fcatory, ADLS GEN2).
- Awareness of Performance evaluation of Client/DL/LLM systems, drift handling
Experience with PySpark/Databricks is valued, not mandatory.
Non Technical skills: Team player with excellent communication and presentation skills. Self motivated.
Experience in aking LLM based systems or Neural Network based systems or mixture of experts based systems to production, is highly valued.
“All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.”