
Principal Data Scientist - Agent Builder
Lead the evaluation strategy and quality metrics for Elastic’s conversational and agentic platform: define offline and online evaluation, build evaluation pipelines, guide retrieval and ranking improvements, and productionize telemetry and CI guardrails. Requires 8+ years applied DS/ML experience in IR/NLP/ranking/semantic search/RAG, hands-on Python and PyTorch/Transformers, and practical Elasticsearch experience. Role includes prototyping, influencing roadmap, and mentoring cross-functional teams.







