/r/Archiveteam

Photograph via snooOG

Archive Team is a loose collective of rogue archivists, programmers, writers and loudmouths dedicated to saving our digital heritage. Since 2009 this variant force of nature has caught wind of shutdowns, shutoffs, mergers, and plain old deletions - and done our best to save the history before it's lost forever.

Archive Team is a loose collective of rogue archivists, programmers, writers and loudmouths dedicated to saving our digital heritage. Since 2009 this variant force of nature has caught wind of shutdowns, shutoffs, mergers, and plain old deletions - and done our best to save the history before it's lost forever.



Related Subreddits


Feel free to join us on the IRC channel! We're on the hackint network in a channel called #archiveteam-bs, where we say truly awful things. Connect with your client of choice or use hackint's online chat.

/r/Archiveteam

14,817 Subscribers

6

Has Anyone Finished Archiving Veoh?

Their site shutdown was scheduled a month ago. Today is the last day with 16 hours left.

I notice they list their videos by categories for their entire site. So all we need to do is archive each category page.

Do you know how to automate the download process? For example with this:

https://veoh.com/find/piano?randText=yx8LsgGDVq3d&page=299

Automating the linkgrabbing and download with title author and upload date, then move on to the next video until page 1 is exhausted then the next page. Rinse and repeat until last page is reached.

Then plug each link into yt-dl.

Sad to say that I only found about this yesterday...

6 Comments
2024/11/10
13:53 UTC

4

Does Archiveteam's Archivebot safely rotate proxies/DNS addresses when it hits captchas when archiving a forum?

2 Comments
2024/11/09
15:21 UTC

9

Archiveteam and the IA

Does every page that Archivteam saves get put up on the Wayback Machine or does that have to manually be done?

6 Comments
2024/11/08
09:37 UTC

58

Manga Library Z, a website that distributed long out-of-print manga unavailable digitally elsewhere, is closing down on November 26.

https://closing.mangaz.com/

More info at https://www.reddit.com/r/manga/comments/1gk2nq6/manga_library_z_an_online_site_that_distributed/

Is there anyone who could work on a ripper and archive as much as possible of the site? There's a real danger that they could be lost media given most of the manga is not available legally or even illegally anywhere else in digital form. There have been attempts at rippers but the site uses an image scramble to combat those, so maybe some kind of program that could unscramble images would help? They have a library of over 4000 manga so it would undoubtedly be a major task, but it's a race against time.

11 Comments
2024/11/05
23:06 UTC

6

So like...what is this?

Like...this whole project has me so confused. How do we access the files that have been archived? I see large datasets hosted on archive.org, but how are we supposed to be able to search for anything, especially the archivebot-GO packs? Using archive.org's search function is practically awful as it is

5 Comments
2024/11/05
18:40 UTC

2

Staging server guide for beginners?

I have some storage and compute laying around and would like to contribute some as a staging server, as my warriors often seem to be bottlenecked at this end.

The only guide i found is this: https://wiki.archiveteam.org/index.php/Dev/Staging and i think it could be written a bit more comprehensive. is there a more comprehensive way to do this?

0 Comments
2024/11/04
17:25 UTC

3

Looking for a game that probably doesn't exist anymore.

For a long time now I've been trying to find a particular game:

Tl;dr It was called Starship and it was found via the Yahoo games list here:

https://imgur.com/KmfuXZJ

https://web.archive.org/web/19961129221717/http://www8.yahoo.com/Recreation/Games/Computer_Games/Titles/

Unfortunately the Archive link is broken and the game was gone before Internet Archive was a thing. I've looked pretty much everywhere, downloaded dozens of game collection ISOs hoping it was in one. no dice.

Since I'm back on the hunt I figured I should maybe ask here and see if anyone has a collection of particularly obscure games from the 90s that contains this game.

10 Comments
2024/11/02
05:10 UTC

6

Has anyone archived Manacled by Senlinyu?

Has anyone archived the entirety of Manacled by Senlinyu? It's going to be removed from AO3 at the end of the year and it's not all on the Web Archive (which still isn't working properly). Also, there needs to be a full archive of TwoSetViolin videos since yesterday as they got privated a couple weeks ago.

1 Comment
2024/11/02
02:43 UTC

23

forum.PCLab.pl, a massive polish IT forum operating since 2002, is shutting down on the end of November 2024

The PCLab forum, a polish community operating since 2002 and serving ~1.3 million posts, is shutting down on the 30th of November 2024.

Official statement [machine translated]:

Dear User,

Please be informed that in 30 days, i.e. November 30, 2024, the PC LAB Forum Website will be closed.

The Administrator of the PC LAB Forum Website - Ringier Axel Springer Polska sp. z o.o. with its registered office in Warsaw: will terminate all services of the PC LAB Forum Website with one month's notice.

The Administrator of the PC LAB Forum Service informs that:

As of November 29, 2024, all services of the PC LAB Forum Service will be terminated. The important reason justifying the termination is the closure of the PC LAB Forum Service.

[...]

After the announcement of the closure of the Forum Service from October 30, 2024, the creation of new accounts in the PC LAB Forum Service will not be possible.

With the closure of the PC LAB Forum Service, i.e. on November 29, 2024, the PC LAB Forum Content Directory will no longer be available. Until then, PC LAB Forum Users can access their content in the “Profile” tab, where they have the possibility to copy or archive it in the form of screenshots. [...]

Worth noting:

I really hope this could get archived,as there is a lot of IT history that will go down the drain with the site.

0 Comments
2024/10/30
19:12 UTC

5

Can the link to archive warrior program be updated

I noticed on http://warrior.archiveteam.org/ that the link to download the appliance goes to https://warriorhq.archiveteam.org/downloads/warrior3/

However, it seems the latest version is actually at

https://warriorhq.archiveteam.org/downloads/warrior4/

thanks

0 Comments
2024/10/29
21:06 UTC

11

Calorie Restriction Society (crsociety.org) forums went back up after a 3-month outage, but we don't know if they'll go down again for good

https://www.crsociety.org

https://www.crsociety.org/topic/18710-crsocietyorg-finally-got-back-online-after-4-months/#comment-48492

The domain owner died some time ago.
I'll try to find a way to scrape them with Winhttrack, but backup would be ideal. These forums aren't too large so they should take not too long to properly archive (there are some threads with 100+ replies and multiple pages that might require some extra nudging by the archive utilities)

1 Comment
2024/10/28
18:38 UTC

11

archiving - archives of highly important lost forums

hiii, there's a domain includes an arabic archived forums divided into threads. they are all so imoprtant on the web, and may be this domain won't survive online. so If anyone could help me for archiving some of them with Archivebot and give me a link to a local copy to preserve , I'd be so grateful . I need them WARCS to be played with replayweb.page desktop app on windows . for now these are the threads I want , https://al-maktaba.org/book/31616

this is the thread number 3. also https://al-maktaba.org/book/31617 number 5 . they're most valuable ones. for a list to all the forum links:

01- https://al-maktaba.org/book/31621

02- https://al-maktaba.org/book/31615

03- https://al-maktaba.org/book/31616

04- https://al-maktaba.org/book/31618

05- https://al-maktaba.org/book/31617

thank you for your hard work on this project, I appreciate that.

note: it was this forum on wayback : https://web.archive.org/web/20140422001403/http://ahlalhdeeth.com/vb/index.php

4 Comments
2024/10/28
03:33 UTC

11

Need help regarding downloading British Comics.

Hey everyone.

So, a bit of a situation going on in a website I usually visit every now and then...

https://britishcomics.wordpress.com/

On October 24th, 2024, Rebellion, who holds rights to many comics, has sent the site creator a DMCA order demanding him to remove all their comics from his British Comics blog, but the site creator realised it was too much to delete, so he will shut down the blog this coming Friday, November 1st, 2024.

Is there a way to download EVERYTHING remaining on the site at once? Some files there are exclusively found there and I don’t want to have to download each file at a time as it would be too time consuming.

Thanks. :)

0 Comments
2024/10/27
19:36 UTC

10

The Shane Dawson Archive Preservation Project

Hello there! So, I know Shane may be a bit of a touchy subject to do an archive preservation for, but growing up, like a lot of you, I actually used to enjoy his videos. Although they can easily be seen as offensive nowadays for obvious reasons, at the time, we didn't really know any better and thought his videos were hilarious. It was shock humor. He made jokes no one would ever dare make nowadays, again for COUNTLESS reasons. But it was a part of my childhood. I want to do my best to make an archive preservation for his work. From ShaneDawsonTV, his second channel (ShaneDawsonTV2, but now renamed to "Human Emoji" a placeholder for project he was gonna do but cancelled), Shane (his iPhone vlog channel before going through multiple different phases until it became what it is today), and his ShaneGlossin channel (now named Shane2), I grew up watching everything. All except the podcast series, which I'm also working on archiving since there was an audio version and a video version made exclusively for Fullscreen.

If you happened to have any videos saved from his channel, any help is always deeply appreciated! There's a lot of content that was either deleted or privated due to controversies, so hopefully there was dedicated fans out there like me who were lucky enough to save a good portion of stuff.

4 Comments
2024/10/22
00:53 UTC

0

HELP FINDING USA Today issue from December 19, 200

I can't find the original copy, does anyone have it? It's my school assignment t-t

6 Comments
2024/10/18
00:43 UTC

40

Accord's Library, a fan website dedicated to gathering and archiving all of Yoko Taro's work, is shutting down due to a Square Enix' C&D. Website's going down on October 31st.

1 Comment
2024/10/17
16:27 UTC

12

PSA: The video sharing website Veoh announced it will shut down soon. You might want to grab videos from there before they are gone.

As the title says, Veoh is shutting down soon per an announcement at the top of the webpage. https://www.veoh.com/ You may want to save videos from there before they are gone.

6 Comments
2024/10/17
15:19 UTC

7

My warrior is perpetually rate limited at basically everything

Is it cause of my settings? It happened with nhentai, url team 2, blogger and telegram. It has just kept retrying endlessly since last week and I haven't seen it download much since. I closed it and restarted, and messed around with the number of concurrent items and resync threads but it has same issue even on 1

1 Comment
2024/10/13
21:42 UTC

2

Vampirefreaks profile archive

Hi, is there a archive of vampirefreaks profiles? I'm looking for a profile in particular but I have no idea of where to look.

0 Comments
2024/10/12
18:21 UTC

7

Are there folks in community planning to archive video game files from AusGamers?

Has anyone tried to backup all stuff from the Files section of AusGamers?

0 Comments
2024/10/09
07:09 UTC

5

My own personal archive + A.I.

Have you tried archiving your own data and training AI on it?

I have a lot of data (texts, photos, videos) that I can't control because I find them on my drives, on my social media channels, etc. I could collect it all in one place by selecting the content that I consider valuable, but sorting it out by people who were there, events and places is a gigantic task that will take at least 40 hours.

Have you tried using AI in such tasks?

What I would like to do:

  • arrange the photos
  • download my data from Google and Facebook and, based on that, draw ideas and conclusions from the conversations I had
  • arrange the texts I had according to my catalogues.
2 Comments
2024/10/08
17:02 UTC

1

Noob at archive searching

I wanna look for videos uploaded by a channel called "Cojum Dip" on Google Video and Yahoo Vídeo but I don't know where I can easily search for which archive has it

Can anybody help me??

2 Comments
2024/10/08
14:26 UTC

3

Searching for a deleted/hidden youtube video but can't find it on filmot

I've been trying to find a youtube video now set to private that I've watched a few months ago.

Unfortunately I can't find it on filmot because it's been almost a year since filmot isn't grabbing new videos released on that channel.

This video definitely has subtitles and I was hoping to at least get them if the video content is now gone forever.

Does anyone know if there are any other places that might still have this video?

Thank you in advance!

4 Comments
2024/10/05
18:12 UTC

2

How can I run archive team warrior automatically on startup?

this would be really convenient. I turn it on everytime I get on the computer

2 Comments
2024/10/03
17:19 UTC

15

The Datpiff archive is now completely gone.

1 Comment
2024/10/01
02:16 UTC

Back To Top