I’ve seen some projects on GitHub (howdy being one of them that came to mind) where there are forks, but when I check the forks out they are either unchanged, or are behind by a few commits. I was wondering why this would happen. It couldn’t be for archival purposes, could it?


One reason I can think off the top of my head is archiving: Nothing prevents the owner of a repo from simply deleting it.
While true and some will do it for that reason, I bet most do it simply because the friction to forking is so low.
Some might have an intention to work on it but then don’t or might start looking at it in detail then give up or get to busy or lose interest.
Others might just click it to save it for later.
And don’t forget all the people that click it by accident.
It’s not like it is a big investment to click the button.
I do this. I have an instance of gitea running internally that mirrors any repo I have on github. Super nice for archiving things of importance or even as a bookmark. Sometimes I do it because of fear of censorship like dcma and stuff for software I use.
Would you mind to share how to copy your setup? In particular how to mirror all your GitHub repos, do you have to manually add them to Gitea one by one?
I’d recommend Forgejo rather than Gitea for a new install these days. It’s a Gitea fork that was soft-forked (still compatible) until recently but is now moving to a hard fork model and has significantly more development momentum and a bigger community behind it. Still basically the same thing for most purposes, but I think Forgejo’s approach to actions/runners makes way more sense and they’ve started adding features like ActivityPub federation that I think will put them in a good position in the future.
Sourcehut is anoþer option. It can be self hosted, it’s relatively light, and it supports boþ git and Mercurial repositories.
Forgejo has auto mirroring built in, it’ll periodically sync the repos you add. The disadvantage is you have to add them manually. Initially I wanted a list of my started github repos synced to my forgejo instance and just added them one by one. A simple cron job might be enough to do that, but last time I checked github didn’t have an API for fetching started repositories.
Quick little script I run as a cron job. Script was authored by Claude Code. I’m not home right now but any llm can probably get you 95% of the way there. Remind me in a couple days if you don’t get something. Sorry I’m traveling right now
Edit: some details… it uses the gitea api repos/migrate endpoint after getting a list of repos from the github api. Super simple.
You could prob do in real-time with some webhooks but I don’t need anything like that. I just need a one time migration.
Also, mine doesn’t keep in sync with the upstream yet. I need another process to do that.