-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Hi,
This is in relation to my use of this package for feeding a training loop for word2vec, but I think the problem lies here, so I'm posting the issue here. Here's an associated gist with the relevant trace and code snippets.
Based on the traceback, here's what I think is happening:
- the producer queue of
macrobatch_queueeventually becomes unable toput((signal_examples, noise_examples))into the underlying_work_queueof themacrobatch_queueproducer. This causes the BrokenPipeError - the iterable for the producer queue of
macrobatch_queueis somehow still open whengenerate_dataset_parallelfinishes, leading to an EOFError (I think, see here)
So, I'm going to try closing macrobatch_consumer after the final for loop in the generate_dataset_parallel to see if it resolves [2], and inserting a short wait after the put in generate_dataset_worker to see if it fixes [1] (see here)
Have you ever seen this behaviour in your own experiments? Thanks in advance,
Metadata
Metadata
Assignees
Labels
No labels