Module assessment

1.

Which evaluation technique can you use to apply your own judgement about the quality of responses to a set of specific prompts?

Model benchmarks

Manual evaluations

Automated evaluations

2.

Which evaluator compares generated responses to ground truth based on standard metrics?

Coherence

F1 Score

Protected material

3.

Which evaluator metric uses an AI model to judge the structure and logical flow of ideas in a response?

Coherence

F1 Score

protected material

Feedback