A friendly reminder to everyone to check out ArchiveBox if you're looking for a self-hosted archiving solution. I've been using it for a while now and it works great; it can be a little rough around the edges at times, but I think it's a wonderful tool. It's allowed me to continue saving pages during the Internet Archive's outage.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
I like ArchiveBox, but in my experience, it kept on running into issues saving pages, and stopped functioning after it worked the first few times. I really wish there was a more streamlined application that did a similar thing somewhere out there.
I've been looking at Linkwarden's page archiving solution, but it crashes whenever I try importing any large number of links, so that's a bust too.
What sort of content are you archiving. I was trying to backup Wikipedia at some point and it was just a nightmare.
“belongs to the USA, and as we all know, this horrendous and hypocritical government supports the genocide that is being carried out by the terrorist state of ‘Israel.'”
Now that's a false-flag if I ever saw one. Out of all the sites of arms manufacturers, funding operations and troll-farms that directly support Israel, these creeps attack... the Internet Archive?
Tell me another one.
100%
We should treasure it and protect it at all cost. This is is also for historical purpose.
This is a concerted effort to change our collective consent https://www.pbs.org/independentlens/documentaries/recorder-the-marion-stokes-project/
I don't know what kind of architecture web.archive.org has, but when it was offline, I thought that we should really have something distributed that would allow people to store and host a copy of all websites that are important for them.
IPFS seems similar to what you're looking for.
(See: A copy of Wikipedia on IPFS being censorship-resistant, and globally distributed)
Doesn't i2p do something similar to this? I don't know much about it but I remember reading it and thinking that it's like bittorrent but no one person has the entire file, or something like that.
didn't you mean IPFS? I2P is a mixnet like the Tor network.
100PB on i2p is a funny idea, but it's not necessarily a bad one.
TIL it's I2P and not L2P
You can save the wacz files?
They've also restored the ability to play audio files, like archived old podcasts and stuff. Which is nice.
Still, fuck the people that randomly decided to harm Internet Archive. It was really on the threat of being a digitalized Burning of Alexandria moment.
It surely wasn't random. There's people out there who don't want you to have anything for free
Interesting how the save now feature was broken until right up until the US election. Makes it easier to edit news articles on the fly without leaving a trail. Could be a motive for who was behind it - but I'm just speculating.
Yeah, totally makes sense, "they" attacked IA one month in advance before the elections, knowing that IA would spend around a month rewriting and improving their site code until the Save Page option would be enabled again (unless IA themselves are a part of the plot???), so that news articles could be "edited on the fly" (with what result?) until the election day, while other similar web archiving services such as archive.is would keep working just fine.
I'm clearly not.....enough to understand what you're implying.
so... what's your point?