Senior AI Engineer – LLM Pipelines & Automation

Company
Krisp
Category
Job Address
Application Deadline
IT
Yerevan, Armenia
17/04/2025
Responsibilities
- Own the end-to-end LLM pipeline, ensuring scalability, maintainability, and documentation - Design and optimize LLM inference pipelines for production, ensuring scalability and reliability - Profile and optimize model performance based on speed, cost, and compute resource utilization - Work with cloud-based AI services (AWS, GCP, Azure) to manage compute resources efficiently - Monitor and log model performance, identifying areas for optimization - Define and implement model evaluation metrics for tracking accuracy, latency, and cost efficiency - Automate LLM testing (e.g., hallucination detection, bias monitoring, and robustness checks) - Collaborate closely with the AI QA Engineer and Prompt Engineer to integrate testing, evaluation, and prompt design into the pipeline
Required Qualifications
- Strong Python and ML framework expertise (PyTorch, Hugging Face) - Deep understanding of LLMs (GPT, Claude, Mistral, etc.) and prompt engineering methodologies - Experience with vector databases and retrieval-augmented generation (RAG) - Experience in profiling and optimizing LLM performance (latency, cost, memory usage) - Strong knowledge of APIs and cloud AI infrastructure - Familiarity with LLM optimization, and deployment strategies is a plus - Ability to document pipeline architecture and workflows for cross-team collaboratio-n
Application Procedures
All interested candidates are encouraged to apply by sending their CV and additional details to talent@krisp.ai . We highly appreciate all applications, however, only shortlisted candidates will be contacted for the next stages. Please mention in your application that you have learned about this position from MyJob.am