machine learning +
LLM Evaluation: Build an LLM-as-Judge Pipeline
machinelearningplus.com
39 min
Gen AI
LLM Evaluation: Build an LLM-as-Judge Pipeline
Build an LLM evaluation pipeline in Python with LLM-as-judge scoring, rubric design, A/B testing, and regression alerts. Runnable code examples included.
