<h3>List of Respositories</h3>
I host some metadata about github's repositories. This includes a lot of basic data about the repository, but NOT the issues, any wiki, downloads, or the git repository:
<ul>
- <li>Full repository metadata is available in batches of 10,000 at <pre>http://za3k.com/github/repos-<X>0000-<X+1>0000.json
+ <li><p>Full repository metadata is available in JSON format. The format is explained on the <a href="https://developer.github.com/v3/repos/#list-all-public-repositories">github API</a>.</p>
+ <p>The files are available in batches of 10,000 at <pre>http://za3k.com/github/repos-<X>0000-<X+1>0000.json
http://za3k.com/github/repos-<X>0000-<X+1>0000.json.gz</pre>
To download all files, run <pre>
wget "http://za3k.com/github/repos-$((x*10000))-$(((x+1)*10000)).json.gz"; \
done
</pre>
- These files are around 10G compressed, 100G uncompressed. The format is explained on the <a href="https://developer.github.com/v3/repos/#list-all-public-repositories">github API</a>. Files are grouped by id; since some repositories are deleted or privated, each one contains significantly less than 10,000 repositories.
+ These files are around 10G compressed, 100G uncompressed. Files are grouped by github's internal id; since some repositories are deleted or privated, each file contains less than 10,000 repositories.
</li>
<li>You can grab greatly abbreviated metadata (recommended) as <a href="https://za3k.com/github/repos.json">JSON</a>. This includes the repository name and URL, a short description, whether it is a fork (and what of), and the approximate size of the repository.</li>
<li>Finally, you can get a txt file of just the repo names: <a href="https://za3k.com/github/repos.txt">txt</a>.</li>