I post about archiving and such.
- 21 Posts
- 9 Comments
archivist@lemm.eeOPto Data Hoarder@lemmy.world•Dailymotion will start deleting 12-months-unwatched videos starting soon3·5 days agoArchiveTeam are “planning” to save everything(?), but they should have started like two months ago, if they have any hope of doing that. Dunno if it’s being worked on at all. Plus committing to a multiple petabyte project I assume takes some doing.
I’ve scarcely visited the site myself, but I looked around for stuff of interest to me and bagged them. yt-dlp works just fine!
archivist@lemm.eeOPMto Internet Archive@lemm.ee•Don't You Believe It (Stop Motion Short Film ~1940)1·23 days agoIt’s odd that while there is one part that’s film negative on purpose, some just seem to be negative for no reason…
After a while, blog posts started returning a 403 error, then later 410. Images and javascript files remained downloadable for longer, but the JS files started returning 410 after a while as well. Now, only images are available, and the known ones are slowly being archived as long as they are downloadable.
archivist@lemm.eeto Québec@lemmy.ca•Gaspé 1969 - Half an hour of driving through eastern Quebec6·25 days agoWasn’t sure where to cross-post it on .ca! Québec, duh! Thanks.
Old “mundane” footage like this is always interesting, I would say!
There does seem to be some tracker rate limiting, but there certainly is a lot of work to be done.
It’s very convenient to have these archives always a click of a button away. Definitely recommend!
archivist@lemm.eeOPMto Internet Archive@lemm.ee•IA is currently experiencing a power outage1·29 days agoThey are back now, but I could find no further info about it.
archivist@lemm.eeOPMto Internet Archive@lemm.ee•IA is currently experiencing a power outage1·30 days agoFor a second I thought it might be another wave of DOS.
In case you haven’t looked into it yourself yet…
ArchiveTeam are independent from IA, but their stuff mostly does end up uploaded into the Wayback Machine. Storage space (like yours) isn’t usually what they are looking for, but rather the internet bandwidth and “virgin” IP address of aforementioned “warriors” running their code to scrape different websites, and then uploading the results to AT’s servers, where they are collected and eventually uploaded again to IA.
Check out https://tracker.archiveteam.org/ for current projects