dropbox Oct 2, 2025 A practical blueprint for evaluating conversational AI at scale (opens in new tab) llmnlpragprompt-engineeringconversational-aisynthetic-datallm-evaluationllm-as-a-judge