Remove supported entity field for context, exact terms, custom and pattern recognizers#25720
Remove supported entity field for context, exact terms, custom and pattern recognizers#25720
Conversation
|
TypeScript types have been updated based on the JSON schema changes in the PR |
|
The Python checkstyle failed. Please run You can install the pre-commit hooks with |
ingestion/src/metadata/pii/algorithms/presidio_recognizer_factory.py
Outdated
Show resolved
Hide resolved
|
TypeScript types have been updated based on the JSON schema changes in the PR |
99b6712 to
4145ff7
Compare
|
@gitar-bot review |
|
TypeScript types have been updated based on the JSON schema changes in the PR |
🔍 CI failure analysis for 7847f87: Three distinct failures: (1) Python unit test across 6 runs (test needs fullyQualifiedName - PR-related), (2) Playwright E2E test on KPI widget (environmental/flaky), (3) Maven AwsCredentialsUtilTest errors (environmental - AWS config, unrelated to PR).Three Distinct CI FailuresFailure 1: Python Unit Test (PR-Related - Requires Fix)Test: Error: Affected: 6 runs across Python 3.10 (5 runs) and 3.11 (1 run) Root Cause: Test creates Tag without Fix: Add Status: Deterministic test bug Failure 2: Playwright E2E Test (Environmental)Job: 62785804406 - playwright-ci-postgresql (shard 2/6) Results: 363 passed / 1 failed / 20 skipped Test: Flow-CustomizeWidgets-KPI-Widget Error: Status: Flaky E2E test unrelated to PR's backend changes Failure 3: Maven Java Tests (Environmental - Unrelated)Job: 62785836393 - maven-postgresql-ci Results: 7,918 passed / 3 errors / 701 skipped Failed Tests: All in
Error: Assessment: AWS credential configuration tests unrelated to PR changes. This PR modifies:
None affect AWS credential utility classes. Status: Environmental infrastructure/configuration issue Code Review ✅ Approved 7 resolved / 7 findingsClean removal of the supportedEntity field from recognizer schemas, replacing it with tag FQN-based identification. Previous findings have been addressed or dismissed per user feedback. No new issues found. ✅ 7 resolved✅ Edge Case: Fallback to "Unknown" entity type creates silent misclassification
✅ Quality: Parameter named
|
| Auto-apply | Compact |
|
|
Was this helpful? React with 👍 / 👎 | Gitar
This is already applied
This is on purpose. We don't care what the supported entities in the recognizer are because our classification mechanism already maps tags to recognizers
For this one-off things I'd prefer not to extract for now. If I've got to do this again then I'll do it in the TagDAO |
Describe your changes:
This removes fields we agreed we wouldn't need in recognizers anymore
Type of change:
Checklist:
Summary by Gitar
supportedEntityfield from four PII recognizer schemas (PatternRecognizer, ContextRecognizer, ExactTermsRecognizer, CustomRecognizer), simplifying the configuration modeltag_fqnparameter and pass it to Presidio recognizers assupported_entity