refactor recursive comment grammar rules with external scanner #8

joshuadavidthomas · 2025-10-06T19:43:13Z

I've been working on a new language extension for the Zed editor for Django templates. I tried using this tree-sitter grammar, but kept running into crashes. The logs from the editor were no help, but after some fumbling around I narrowed it down to the comment rules.

Both unpaired_comment and paired_comment use recursive patterns that I think are the cause of the issue, possibly because Zed extensions get compiled to WASM (though that's just a hunch, no concrete evidence that's the core issue).

The problematic patterns:

unpaired_comment: repeat(seq(alias($.unpaired_comment, ""), repeat(/.|\s/)))
paired_comment: repeat(seq(alias($.paired_comment, ""), repeat(/.|\s/)))

To fix this, I made two changes, one small to unpaired comments and one large to paired comments.

For unpaired comments, I changed to a simple token() pattern -- Django just ignores everything between {# and #}, so no recursion needed.

For paired comments, I added an external C scanner inspired by tree-sitter-liquid, but took a different approach to preserve the original parsing behavior. The scanner uses depth tracking to find the balanced closing {% endcomment %}, incrementing depth when it sees nested {% comment %} tags and decrementing for {% endcomment %}. This maintains the exact same tree structure as the original grammar (single comment node), just without the recursive patterns that caused crashes.

interdependence · 2025-10-11T12:24:25Z

Interesting. It's been a while since I touched this project, but I specifically designed the original implementation to avoid having to write a scanner. If implemented for comments, I'm thinking it might also be a good idea to consider using a similar technique for paired statements in general. I will test this out. Just to clarify, this fixed your Zed extension issues?

fix recursive comment grammar rules with external scanner

0c07bfd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor recursive comment grammar rules with external scanner #8

refactor recursive comment grammar rules with external scanner #8

Uh oh!

joshuadavidthomas commented Oct 6, 2025 •

edited

Loading

Uh oh!

interdependence commented Oct 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

refactor recursive comment grammar rules with external scanner #8

Are you sure you want to change the base?

refactor recursive comment grammar rules with external scanner #8

Uh oh!

Conversation

joshuadavidthomas commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

interdependence commented Oct 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

joshuadavidthomas commented Oct 6, 2025 •

edited

Loading