
Principal Data Scientist - Agent Builder
Lead technical direction for evaluating and scaling conversational search and agent quality (RAG, agents, tools, retrieval, citations, memory). Define evaluation strategies, metrics, and decision frameworks; prototype and productionize evaluation pipelines and telemetry; collaborate with engineering, product, and UX; mentor data scientists. Requires 8+ years applied DS/ML experience with expertise in IR, NLP, ranking, semantic search, RAG/LLM systems and hands-on experience with Python, PyTorch/Transformers, and Elasticsearch.












