Position Overview
Job Description
Our client is looking for a LLM Engineer/Researcher. They have a DGX Cluster with 8 x H100s and are actively looking to fine tune, and eventually develop our own LLM models.
Responsibilities
● Train and Fine-tune foundational LLM models (e.g. using PEFT, Lora, QLora) to meet business needs
● Build and maintain LLM applications and infrastructure to meet business needs
● Design LLM inference infrastructure to scalably deploy LLMs within infrastructural constraints
● Research and utilize best of class tools within LLM ecosystem (e.g. Vector databases, LlamaIndex, etc)
● Keep up with latest research around LLMs (e.g. sparse models, hardware-specific LLMs)
● Research and keep up with latest use-cases of LLMs (e.g. RAG, Agents, etc)
● Collaborate closely with LLM research teams to participate in foundation model research, specifically for training productivity-related LLMs
...