Skip to content

Explain text normalization in license matching documentation#4854

Open
kushal-p16 wants to merge 1 commit intoaboutcode-org:developfrom
kushal-p16:patch-3
Open

Explain text normalization in license matching documentation#4854
kushal-p16 wants to merge 1 commit intoaboutcode-org:developfrom
kushal-p16:patch-3

Conversation

@kushal-p16
Copy link

This PR adds documentation explaining how ScanCode handles text normalization during license matching, including handling of whitespace, punctuation, and case differences.

This helps clarify why similar license texts with minor formatting differences can still result in a match.

Fixes #4843

Add documentation explaining how ScanCode handles text normalization during license matching.

This includes handling of whitespace, punctuation, and case differences to ensure robust license detection even when text formatting varies.


Signed-off-by: KUSHAL P <kushalmys55@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Explain when and where comma, whitespace, etc. are ignored

1 participant