The CPU only version errors out on running as the code still believes it is running on a CUDA compatible device and not on the CPU.
Similarly, on opening the HuggingFace demo, I get a runtime error as memory has been exceeded with the error logs showing the same errors as those I get while running the CPU only version.