Full Stack Engineer - GenAI & LLM
Experience: 5 - 20 Years
Location: North America - Permanent Remote
Must-Have
- Experience with Microsoft OpenAI Azure or Google Vertex
- Experience developing GenAI applications (doesn't have to be professional)
- End-to-end full-stack application development
- Experience developing API endpoints in Python or Java
- Experience with PyTorch and/or TensorFlow
- Experience working with multiple AI Frameworks (hugging face, semantic search, RAG, etc)
Platform / Stack
You will work with technologies that include Microsoft OpenAI Azure, Google Vertex, PyTorch, and RAG Architecture.
What You'll Do As a Sr Software Engineer
- Architect, design, and develop AI applications, integrating with Google Vertex, Microsoft OpenAI Azure, and other LLM suites
- Design and implement effective prompts, configure LLM settings, and optimize performance through prompt crafting, RAG, fine-tuning and other techniques
- Collaborate with cross-functional teams to define requirements, manage user expectations, and deliver high-quality AI solutions
- Develop and maintain API endpoints, front-end features, and full-stack applications that leverage LLMs and Generative AI models
- Implement AI applications that comply with ethical guidelines and legal standards, particularly regarding data privacy and user consent
- Integrate analytics and monitoring tools to track user interactions, application performance, and the efficiency of LLM integrations
- Mentor, motivate, and develop the technical capabilities of the existing engineering team
- Stay up-to-date with emerging trends and advancements in Generative AI, LLMs, and related technologies
Qualifications
You could be a great fit if you have:
- 8+ years of experience in full-stack software development, with a strong focus on building enterprise-scale distributed and cloud or hybrid-cloud applications
- Regarded as an expert in the growing field of AI with 5+ years of experience developing AI solutions and prototypes, including Generative AI and LLMs
- Experience with PyTorch, TensorFlow, ONNX, LangChain, Kubernetes, and Docker
- Deep understanding of AI frameworks including Huggingface, semantic search, RAG, LLM agents, AgentGPT, orchestration, plugins, and LLM Ops
- Experience with Retrieval-Augmented Generation (RAG) architectures or frameworks like Langchain for building LLM-powered applications
- Proficiency in programming languages such as Python, Java, JavaScript, and experience with frameworks like React and Node.js
- Experience with cloud platforms such as Google Vertex, Microsoft OpenAI Azure, AWS, and Azure, using various solutions for developing integrations, APIs, and AI/ML applications
Skills: api endpoints in python,cloud platforms,distributed and cloud,retrieval-augmented generation (rag) architectures,enterprise-scale distributed and cloud or hybrid-cloud applications,langchain,microsoft,full-stack application development,generative ai,api endpoints,pytorch,node.js,llm-powered applications,ethical guidelines,semantic search,llm integrations,hybrid-cloud applications,distributed and cloud application development,programming languages,cloud platforms (google vertex, microsoft openai azure, aws, azure),analytics and monitoring tools,cloud platforms integration,kubernetes,python,tensorflow,java,full-stack software development,onnx,proficiency in programming languages,api,end-to-end full-stack application development,ai/ml applications,microsoft openai azure,vertex,google vertex,rag,genai applications,llms,docker,cloud,api endpoints in java,api development,hugging face,llm suites,cloud or hybrid-cloud applications,react,javascript,google,programming languages (python, java, javascript),azure,ai frameworks,ai solutions and prototypes,enterprise-scale distributed and cloud applications,legal standards,huggingface,aws