- Position Type: Contract (6 Months, likely to extend or convert).
- Work Location: Onsite – Columbus, OH (Local or willing to relocate Day 1).
- Work Authorization: Must be authorized to work in the United States without sponsorship.
- Industry: Research and Advanced Technology.
- Design and optimize data pipelines serving LLM and Generative AI applications.
- Integrate Generative AI systems (e.g., OpenAI, Azure OpenAI, Anthropic, LLaMA, Mistral) with curated enterprise data sources.
- Develop and maintain retrieval-augmented generation (RAG) pipelines connecting structured/unstructured data to AI model contexts.
- Collaborate with data scientists, ML engineers, and AI researchers to ensure data readiness and model efficiency.
- Implement agentic system architectures using frameworks like LangChain, Semantic Kernel, or LlamaIndex.
- Apply best practices in AI security, data governance, and compliance to ensure responsible AI development.
- Automate LLM evaluation, fine-tuning, and deployment workflows while maintaining high system availability and accuracy.
- Proven experience as a Data Engineer or ML Engineer integrating LLM or Generative AI systems.
- Proficiency in Python, SQL, and distributed data frameworks such as Spark or DataBricks.
- Strong understanding of RAG architectures and vector databases (e.g., Pinecone, Weaviate, Chroma, FAISS).
- Experience with orchestration frameworks such as LangChain, LlamaIndex, or Semantic Kernel.
- Understanding of AI security, data privacy, and prompt injection defenses.
- Experience working with Azure DataBricks, Azure AI Services, or Azure OpenAI.
- Strong collaboration, problem-solving, and communication skills.
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- Experience fine-tuning or customizing LLMs for enterprise use cases.
- Familiarity with MLflow, MLOps, or CI/CD for AI model deployment.
- Understanding of Delta Lake or medallion data architecture for AI-ready pipelines.
- Experience with streaming systems such as Kafka or Event Hubs.
- Contributions to open-source AI or LLM integration projects.
- Contribute to next-generation AI innovation within a globally recognized research and technology organization.
- Work hands-on with cutting-edge technologies—LLMs, Generative AI, RAG pipelines, and orchestration frameworks—to power real-world data intelligence systems.
- Collaborate with top data scientists and AI researchers in a mission-driven environment that values innovation and long-term impact.
- Onsite position in Columbus, OH with the potential for long-term extension or conversion to a permanent role.
#PSI2
#LI-JU1
Work Location: Hybrid remote in Columbus, OH 43219