This repository was archived by the owner on Jul 11, 2023. It is now read-only.

Description
Overview
Currently, it's not reproducible. Every new Github Search will return a different set of packages. It would be not critical it it wouldn't miss some popular ones like https://github.com/datasets/covid-19
We can start doing commutative searches - appending new repositories found to data/packages.csv instead of re-creating it every time from a scratch.
Also, we might consider a manual way to add data package e.g. from a curated list.
Probably we can use Frictionless Transform just merging data/pakages.raw.csv and data/packages.csv