Improve classification quality with self-critique, majority voting, and cross-project consistency by migeed-z · Pull Request #2841 · facebook/pyrefly

migeed-z · 2026-03-21T01:08:06Z

Summary:
The primer classifier has been producing inconsistent results across runs — the same primer diff can be classified as 'improvement' in one run and 'regression' in another. This was observed on real PRs like #2839 (altair TypeVar iterability) and #2764 (overload resolution, 60+ projects).

Three changes to improve reliability:

Self-critique pass (Pass 1.5): After Pass 1 produces reasoning, a new pass checks it for factual errors — e.g., claiming dicts are not iterable, incorrect inheritance claims, wrong TypeVar constraint analysis. This catches hallucinations before they reach the verdict pass. Tested on PR Avoid pinning solver variables too early when RHS is a union (#2839) #2839 where it correctly identified that both constraints of _C (list and TypedDict) are iterable.
Majority voting on verdict (Pass 2): Instead of a single verdict call, makes 5 independent calls and takes the majority. This reduces non-determinism where the same reasoning could be classified either way. Vote distribution is logged for transparency.
Cross-project consistency enforcement: After classifying all projects independently, groups them by error kind and enforces majority verdict within each group. This prevents the classifier from saying 'overload resolution improved' for one project and 'overload resolution regressed' for another with the same pattern.

Also upgrades the default Anthropic model from claude-opus-4-20250514 to claude-opus-4-6 for better Pass 1 reasoning quality.

Differential Revision: D97571454

Summary: I noticed when looking at the classifier output for facebook#2764 that the "verdict" formatting needed to be fixed. Two fixes: 1. formatter.py: Add _format_reason() to render JSON reason dicts as labeled readable sections (e.g. "**Spec check:** ...", "**Reasoning:** ...") 2. llm_client.py: Ensure reason is always a string by serializing dict values, so downstream code handles it consistently. Reviewed By: grievejia Differential Revision: D97422229

…nd cross-project consistency Summary: The primer classifier has been producing inconsistent results across runs — the same primer diff can be classified as 'improvement' in one run and 'regression' in another. This was observed on real PRs like facebook#2839 (altair TypeVar iterability) and facebook#2764 (overload resolution, 60+ projects). Three changes to improve reliability: 1. **Self-critique pass (Pass 1.5)**: After Pass 1 produces reasoning, a new pass checks it for factual errors — e.g., claiming dicts are not iterable, incorrect inheritance claims, wrong TypeVar constraint analysis. This catches hallucinations before they reach the verdict pass. Tested on PR facebook#2839 where it correctly identified that both constraints of `_C` (list and TypedDict) are iterable. 2. **Majority voting on verdict (Pass 2)**: Instead of a single verdict call, makes 5 independent calls and takes the majority. This reduces non-determinism where the same reasoning could be classified either way. Vote distribution is logged for transparency. 3. **Cross-project consistency enforcement**: After classifying all projects independently, groups them by error kind and enforces majority verdict within each group. This prevents the classifier from saying 'overload resolution improved' for one project and 'overload resolution regressed' for another with the same pattern. Also upgrades the default Anthropic model from claude-opus-4-20250514 to claude-opus-4-6 for better Pass 1 reasoning quality. Differential Revision: D97571454

meta-codesync · 2026-03-21T01:08:30Z

@migeed-z has exported this pull request. If you are a Meta employee, you can view the originating Diff in D97571454.

yangdanny97

Review automatically exported from Phabricator review in Meta.

migeed-z added 2 commits March 20, 2026 18:07

meta-cla bot added the cla signed label Mar 21, 2026

meta-codesync bot added fb-exported meta-exported labels Mar 21, 2026

yangdanny97 assigned migeed-z Mar 21, 2026

yangdanny97 approved these changes Mar 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve classification quality with self-critique, majority voting, and cross-project consistency#2841

Improve classification quality with self-critique, majority voting, and cross-project consistency#2841
migeed-z wants to merge 2 commits intofacebook:mainfrom
migeed-z:export-D97571454

migeed-z commented Mar 21, 2026

Uh oh!

meta-codesync bot commented Mar 21, 2026

Uh oh!

yangdanny97 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

migeed-z commented Mar 21, 2026

Uh oh!

meta-codesync bot commented Mar 21, 2026

Uh oh!

yangdanny97 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants