加入 登录

Arfin Jahan

成为会员时间:2024

黄金联赛

997 积分
機器學習運作 (MLOps) 與 Vertex AI:模型評估 Earned Dec 22, 2025 EST
Model evaluation on Vertex AI Earned Dec 9, 2025 EST
Find, Explore and Deploy Model Garden Models Earned Dec 9, 2025 EST

本課程針對評估生成式和預測式 AI 模型,向機器學習從業人員介紹相關的基礎工具、技術和最佳做法。模型評估是機器學習的重要領域,確保這類系統能在正式環境中提供可靠、準確且成效優異的結果。 學員將深入瞭解多種評估指標與方法,以及適用於不同模型類型和工作的應用方式。此外,也會特別介紹生成式 AI 模型帶來的獨特難題,並提供有效的應對策略。透過 Google Cloud Vertex AI 平台,學員將瞭解在模型挑選、最佳化和持續監控方面,該如何導入穩健的評估程序。

了解详情

This course delves into the complexities of assessing the quality of large language model outputs. It examines the challenges enterprises face due to the subjective and sometimes incorrect nature of LLM responses, including hallucinations and inconsistent results. The course introduces various evaluation metrics for different tasks like classification, text generation, and question answering, such as Accuracy, Precision, Recall, F1 score, ROUGE, BLEU, and Exact Match. It also explores evaluation methods offered by Vertex AI LLM Evaluation Services, including computation-based, autorater, and human evaluation, providing insights into their application and benefits. Finally, the module covers how to unit test LLM applications within Vertex AI.

了解详情

Model Garden is a model library that helps you discover, test, and deploy models from Google and Google partners. Learn how to explore the available models and select the right ones for your use case. And how to deploy and interact with Model Garden models through the Google Cloud console and APIs.

了解详情