Repeatability
High
The task is structurally identical every time: ingest, validate, deduplicate, export, report. There are no instance-specific judgment calls that change the shape of the solution.
Ambiguity Tolerance
High
Success criteria are concrete and verifiable — valid regex matches, no duplicate emails in output, accurate summary counts, and a readable output CSV. A non-human can confirm correctness by running the script against test data.
Data & Tool Availability
High
The agent only needs Python's standard library (csv, re, collections) plus optionally pandas or chardet for encoding detection — all freely available. No external APIs, credentials, or live systems are required.
Error Cost
Low
The script writes to a new output file, leaving the original CSV untouched. Any bugs are immediately visible in the output or summary report and trivially reversible by re-running a corrected version.
Human Judgment Required
Low
Regex patterns for email and phone are well-established standards, deduplication by email is unambiguous, and encoding/delimiter detection is a solved engineering problem. No taste, ethics, or relationship context is needed.