payslip-ingest/tests
Viktor Barzin 86cac65572 processor: skip non-payslip docs by title pattern
The Paperless 'payslip' tag has been applied over the years to P60 annual
summaries, performance/year-end letters, Compensation_EMEA/PSC letters,
comp-review letters, and RSU grant agreements. These are legitimate
financial docs but not monthly payslips, and including them pollutes
the dashboards (a P60 amount is ~12x a single month).

Filter by title regex before hitting Claude so we skip cheaply and
don't burn extraction credit on them. Status returned is
'skipped_non_payslip' to distinguish from the 'already-ingested' skip.

Covers: P60*, *performance*(letter|year-end)*, compensation_emea,
*psc*, comp-letter, rsu grant*. New parameterized tests cover both
the exclude list and representative real payslip titles.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-18 23:32:17 +00:00
..
__init__.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00
conftest.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00
test_extractor.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00
test_paperless.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00
test_processor.py processor: skip non-payslip docs by title pattern 2026-04-18 23:32:17 +00:00
test_schema.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00
test_tax_year.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00
test_webhook.py Initial commit: event-driven UK payslip ingest service 2026-04-18 22:10:23 +00:00