Fix OOM on large spooled result sets by gopinathnelluri · Pull Request #590 · trinodb/trino-python-client

gopinathnelluri · 2026-02-11T17:38:32Z

Previously, TrinoQuery.fetch() eagerness caused all segments to load into memory. This led to OOM errors on large datasets.

Changes:

Enable lazy loading by returning SegmentIterator directly in fetch().
Update execute() to handle result rows as iterators instead of requiring lists.
Add unit test to verify lazy fetching implementation.

Description

This PR addresses a critical memory issue (OOM) encountered when fetching large result sets with fault-tolerant execution (spooling) enabled.

Previously, TrinoQuery.fetch() would materialize all spooled segments into a list immediately upon retrieval, even if user code was iterating row-by-row. For large datasets (e.g., 100M+ rows), this caused the client to consume excessive memory and crash.

Changes:

Modified TrinoQuery.fetch() to return a SegmentIterator directly instead of materializing it into a list when the fetch mode is standard.
Updated TrinoQuery.execute() to handle self._result.rows as an iterator (using itertools.chain) instead of assuming it is always a list.
Added a new unit test tests/unit/test_spooling.py verifying that segments are fetched lazily and memory usage remains bounded.

Non-technical explanation

Fixed an issue where downloading very large datasets (100 million+ rows) when using fault-tolerant execution would cause the client to run out of memory and crash. The client now streams data efficiently instead of trying to load everything at once.

Release notes

(x) Release notes are required, with the following suggested text:

Fixed OOM error when fetching large spooled result sets by enabling lazy loading of segments.

cla-bot · 2026-02-11T17:38:36Z

Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to cla@trino.io. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla

findepi · 2026-02-13T14:30:02Z

@gopinathnelluri please let us know when you have submitted the CLA form.

Previously, TrinoQuery.fetch() eagerness caused all segments to load into memory at once when using fault-tolerant execution. This led to OOM errors on large datasets. Changes: - Enable lazy loading by returning SegmentIterator directly in fetch(). - Update execute() to handle result rows as iterators instead of requiring lists. - Add unit test to verify lazy fetching implementation.

cla-bot · 2026-02-15T06:41:36Z

Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to cla@trino.io. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla

damian3031

Great fix! I've added a few comments regarding style.

@hashhar should tests be rewritten to avoid using mocking library? (DEVELOPMENT.md discourages that). I can help with that if needed.

damian3031 · 2026-02-16T14:39:15Z

trino/client.py

+            if isinstance(self._result.rows, list) and len(self._result.rows) == 0:
+                 new_rows = self.fetch()
+                 if isinstance(new_rows, list):
+                     self._result.rows += new_rows
+                 else:
+                     # It's an iterator (spooled segments), replace rows with it
+                     self._result.rows = new_rows
+                     # We have an iterator now, so we can return result to user
+                     break
+            else:
+                 # We have data (list with items or an iterator), so return
+                 break


Suggested change

if isinstance(self._result.rows, list) and len(self._result.rows) == 0:

new_rows = self.fetch()

if isinstance(new_rows, list):

self._result.rows += new_rows

else:

# It's an iterator (spooled segments), replace rows with it

self._result.rows = new_rows

# We have an iterator now, so we can return result to user

break

else:

# We have data (list with items or an iterator), so return

break

# Stop if we have a non-empty list or an iterator

if not isinstance(self._result.rows, list) or self._result.rows:

break

new_rows = self.fetch()

if isinstance(new_rows, list):

self._result.rows.extend(new_rows)

elif isinstance(new_rows, SegmentIterator):

self._result.rows = new_rows

break

else:

raise TypeError(

f"fetch() returned {type(new_rows).__name__}, expected list or SegmentIterator"

)

This part could be made a bit more readable:

outer else can be avoided.

explicitly check for rows type. Raise an error if the type is neither list nor SegmentIterator

extend is more idiomatic than += for lists.

comments can be simplified a bit

damian3031 · 2026-02-16T15:19:34Z

trino/client.py

+        """
+        Execute should block until at least one row is received or query is finished or cancelled
+
+        For Standard Execution, rows is a list, we can check len. the first response usually contains no rows (just stats),
+        so we need to continue fetching until we get some rows or query is finished or cancelled.
+
+        For Spooled Execution, rows start as empty list and eventually fetch returns the rows as iterator, 
+        we can't check len of an iterator easily without peeking. 
+
+        So, if we get rows as non empty list or iterator, we stop blocking and return it to the caller to consume it.
+        """


This docstring is too verbose, it should be a short comment as it was previously, for example:

# Execute should block until the query is finished or cancelled, # or until at least one row is received (direct protocol), # or an iterator is received (spooling protocol).

gopinathnelluri · 2026-02-17T21:19:59Z

@findepi Submitted the signed CLA

findepi requested a review from wendigo February 13, 2026 14:28

findepi requested a review from hashhar February 13, 2026 14:30

gopinathnelluri force-pushed the OOM-Issue branch from f178dc3 to 049421b Compare February 15, 2026 06:41

damian3031 reviewed Feb 16, 2026

View reviewed changes

gopinathnelluri marked this pull request as draft February 19, 2026 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix OOM on large spooled result sets#590

Fix OOM on large spooled result sets#590
gopinathnelluri wants to merge 1 commit intotrinodb:masterfrom
gopinathnelluri:OOM-Issue

gopinathnelluri commented Feb 11, 2026

Uh oh!

cla-bot bot commented Feb 11, 2026

Uh oh!

findepi commented Feb 13, 2026

Uh oh!

cla-bot bot commented Feb 15, 2026

Uh oh!

damian3031 left a comment

Uh oh!

damian3031 Feb 16, 2026

Uh oh!

damian3031 Feb 16, 2026

Uh oh!

gopinathnelluri commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

gopinathnelluri commented Feb 11, 2026

Description

Non-technical explanation

Release notes

Uh oh!

cla-bot bot commented Feb 11, 2026

Uh oh!

findepi commented Feb 13, 2026

Uh oh!

cla-bot bot commented Feb 15, 2026

Uh oh!

damian3031 left a comment

Choose a reason for hiding this comment

Uh oh!

damian3031 Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

damian3031 Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

gopinathnelluri commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants