Exam CCA-F Topic 1 Question 295 Discussion
Actual exam question for Anthropic's CCA-F exam
Question #: 295
Topic #: 1
Question #: 295
Topic #: 1
When implementing a self-evaluation loop for structured data extraction, your agent reports 96% aggregate accuracy, but downstream users complain about frequent errors on complex contracts. What monitoring methodology resolves this evaluation visibility gap?
Suggested Answer: A Vote an answer
Aggregate accuracy metrics can mask severe per-document-type failures (e.g., contracts failing frequently while simple receipts succeed nearly 100% of the time). Tracking accuracy using stratified metrics (per document type and field) reveals these hidden failures before automating high-confidence extractions.
by Ruby at Jun 22, 2026, 02:57 PM
Comments
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Report Comment
Commenting
You can sign-up / login (it's free).