Use possessive quantifiers in excerpt helper regexps to prevent backtracking by zachasme · Pull Request #2738 · basecamp/fizzy

zachasme · 2026-03-20T13:10:36Z

Hopefully fixes Regexp timeouts for very large cards. See https://3.basecamp.com/2914079/buckets/27/card_tables/cards/9700847055

Also cleans up the format_html helper.

…racking Also cleans up the format_html helper.

Copilot

Pull request overview

This PR aims to reduce the risk of Regexp timeouts when generating excerpts from very large card contents by adjusting the excerpt-normalization regexes, and it also refactors the HTML formatting helper to encapsulate scrubber memoization.

Tip

If you aren't ready for review, convert to a draft PR.
Click "Convert to draft" or run gh pr ready --undo.
Click "Ready for review" or run gh pr ready to reengage.

Changes:

Update format_excerpt regex quantifiers to use possessive *+ in an attempt to reduce backtracking on large inputs.
Refactor format_html to call a private auto_link_scrubber helper for memoization instead of using an inline instance variable.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
app/helpers/html_helper.rb	Refactors scrubber memoization into a private helper method used by `format_html`.
app/helpers/excerpt_helper.rb	Adjusts excerpt-normalization regexes to reduce backtracking risk on very large strings.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-20T13:15:12Z

app/helpers/excerpt_helper.rb

+    text = text.gsub(/^>\s*+(.*)$/m, '> \1')
+    text = text.gsub(/^\s*+[-+]\s*+(.*)$/m, '• \1')
+    text = text.gsub(/^\d+\.\s*+(.*)$/m) { |m| m }


In Ruby, the /m flag makes . match newlines, so (.*) here can still span large portions of the string and may still trigger heavy backtracking on very large inputs (which this PR is trying to avoid). Consider removing /m and matching per-line explicitly (e.g., using a negated character class like [^\n]*+ / [^\r\n]*+, or making the trailing quantifier possessive) so the whole pattern stays linear-time on multiline content.

Suggested change

text = text.gsub(/^>\s*+(.*)$/m, '> \1')

text = text.gsub(/^\s*+[-+]\s*+(.*)$/m, '• \1')

text = text.gsub(/^\d+\.\s*+(.*)$/m) { |m| m }

text = text.gsub(/^>\s*+([^\r\n]*+)$/, '> \1')

text = text.gsub(/^\s*+[-+]\s*+([^\r\n]*+)$/, '• \1')

text = text.gsub(/^\d+\.\s*+([^\r\n]*+)$/) { |m| m }

Copilot · 2026-03-20T13:15:13Z

app/helpers/excerpt_helper.rb

-    text = text.gsub(/^\d+\.\s*(.*)$/m) { |m| m }
+    text = text.gsub(/^>\s*+(.*)$/m, '> \1')
+    text = text.gsub(/^\s*+[-+]\s*+(.*)$/m, '• \1')
+    text = text.gsub(/^\d+\.\s*+(.*)$/m) { |m| m }


This gsub is a functional no-op (the block returns the full match unchanged), but it still forces an extra regex scan/allocation on large strings. Either remove it, or change it to actually normalize ordered-list spacing (similar to the quote/bullet replacements) without re-copying the string unnecessarily.

Suggested change

text = text.gsub(/^\d+\.\s*+(.*)$/m) { |m| m }

Use possessive quantifiers in excerpt helper regexps to prevent backt…

3d54390

…racking Also cleans up the format_html helper.

Copilot AI review requested due to automatic review settings March 20, 2026 13:10

Copilot started reviewing on behalf of zachasme March 20, 2026 13:11 View session

Copilot AI reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use possessive quantifiers in excerpt helper regexps to prevent backtracking#2738

Use possessive quantifiers in excerpt helper regexps to prevent backtracking#2738
zachasme wants to merge 1 commit intomainfrom
except-helper-possessive-quantifiers

zachasme commented Mar 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zachasme commented Mar 20, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants