Looking at create_embeddings.py it appears that the code does not crawl what it finds in the site map. It seems to only use the URLs directly referred by the site map. If the site map has links to other site maps then the script does not work. See the example attached here.
sitemap_index.xml.txt
This should be made clear in the README.
Looking at
create_embeddings.pyit appears that the code does not crawl what it finds in the site map. It seems to only use the URLs directly referred by the site map. If the site map has links to other site maps then the script does not work. See the example attached here.sitemap_index.xml.txt
This should be made clear in the README.