Do you use anything to archive content for yourself or others? (research, videos, articles, and anything that could be lost to time or censorship)

Otter@lemmy.ca · edit-2 5 days ago

Do you use anything to archive content for yourself or others? (research, videos, articles, and anything that could be lost to time or censorship)

Otter@lemmy.ca · 5 days ago

One option that I’ve heard of in the past

ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline.

Krafting@lemmy.world · 5 days ago

I archive youtube videos that I like with TubeArchivist, I have a playlist for random videos i’d like to keep, and also subscribe to some of my favourite creator so I can keeptheir videos, even when I’m offline

vividspecter@lemm.ee · 5 days ago

I’ll add pinchflat as an alternative with the same aim.

Ludrol@szmer.info · 5 days ago

https://wiki.archiveteam.org/

they have an automatic VM that dowloads stuff in distributed manner and uploads to archive.org

yasser_kaddoura@lemmy.world · edit-2 5 days ago

I have a script that archives to:

I used to solely depend on archive.org, but after the recent attacks, I expanded my options.

Script: https://gist.github.com/YasserKa/9a02bc50e75e7239f6f0c8f04fe4cfb1

EDIT: Added script. Note that the script doesn’t include archiving to archivebox, since its API isn’t available in stable verison yet. You can add a function depending on your setup. Personally, I am depending on Caddy and docker, so I am using caddy module [1] to execute commands with this in my Caddyfile:

route /add {
	@params query url=*
	exec docker exec --user=archivebox archivebox archivebox add {http.request.uri.query.url} {
		timeout 0
	}
}

[1] https://github.com/abiosoft/caddy-exec

WhyJiffie@sh.itjust.works · 4 days ago

isn’t this prone to a

 || rm -rf /

or something similar at the end of the URL?

if you can docker exec, you have a lot of privileges already, so be sure to make sure this is not a danger

catloaf@lemm.ee · 5 days ago

I don’t self-host it, I just use archive.org. That makes it available to others too.

Zachariah@lemmy.world · 5 days ago

It’s a single point of failure though.