About the Role:
We are seeking a Machine Learning Engineer to work along a team of experienced ML engineers in building cutting-edge efficient multimodal models in production. This spans across designing, implementing, training, and deploying such models in real-world applications. If you're excited about contributing to the future of efficient and versatile AI, this role is your chance to make an impact!
Responsibilities:
- Contribute to designing and shaping the fundamentals of the company’s core foundation model
- Implement and prove the efficacy of different ideas in an advanced in-house distributed training and evaluation framework
- Contribute to enhancing and augmenting the core training and evaluation infrastructure
- Build a suit of multimodal applications including but not limited to media perception and generation (text, audio, video, image), chat interface, agents and more!
- Work closely with the executives to understand and identify complex challenges and contribute to resolving them
- Continuously learn and expand your technical horizons
Requirements:
- Experience: You have 2-5 years of hands-on experience in the field of Machine Learning, with a particular focus on building and training models—ideally large language models and/or generative image/video models.
- Product Development: You've successfully built and deployed software products in a professional setting, demonstrating your ability to take ideas from concept to execution.
- Coding Excellence: Your Python skills are top-notch. You’ve written clean, well-architected, and test-covered code, with a deep understanding of the Python ecosystem, including frameworks like PyTorch or JAX.
- ML Experimentation: You have experience experimenting with ML models using the necessary tools and frameworks, bringing theoretical ideas into practical, working solutions.
- Startup Experience: You’re comfortable with the fast-paced, ever-evolving demands that come with early-stage startup environments.
- Expertise in the following technologies: training and evaluation of large-language models (LLMs), proficient in deep learning frameworks i.e. PyTorch or JAX, exposure to multimodal models at scale
- Nice to have: experience with developing low-level GPU kernels in CUDA or ROCm, exposure to low-resource model training and inference e.g. quantization, etc., exposure to App and API development for AI solutions
We Offer
- Year-end bonus
- Visa sponsorship
- Matched 401(k) plan
- Unlimited time off or sick days
- Founding member equity
- Industry-leading insurance (dental, medical, vision)