Exam CCA-F Topic 1 Question 295 Discussion

Actual exam question for Anthropic's CCA-F exam
Question #: 295
Topic #: 1

When implementing a self-evaluation loop for structured data extraction, your agent reports 96% aggregate accuracy, but downstream users complain about frequent errors on complex contracts. What monitoring methodology resolves this evaluation visibility gap?

A. Tracking accuracy by specific document type and field segment (stratified metrics).

B. Increasing the overall sampling size randomly to capture a broader dataset.

C. Switching entirely to the Message Batches API for asynchronous processing.

D. Executing the evaluation passes inside a PreToolUse hook.

Exam CCA-F Topic 1 Question 295 Discussion

Comments