00:00:07<Webuser717640>good day, would like to ask how to fix blank current project and no available projects.. was fine the last time i opened it, and already restarted vm and vm software
00:00:24<Webuser717640>nvm just magically fixed now and projects are showing
00:01:27<Webuser201884>Any idea why telegram project might be giving me an error about unable to submit discovered urls
00:01:59etnguyen03 quits [Client Quit]
00:02:38<TheTechRobo>Webuser201884: Is it still happening?
00:03:01<@imer>KoleiTheBat: Webuser201884: Webuser717640: tracker was down for a bit, looks like it's coming back now?
00:03:02<TheTechRobo>Looks like everything came back up a few minutes ago.
00:04:08<Webuser201884>Nope, just restarted it seems ok now
00:04:12<Webuser201884>its flying
00:09:04<nyakase>im getting the unable to submit discovered urls, one sec, will send a snip
00:09:22<nyakase>https://hakase.nekoweb.org/scraps/urlsfail.txt
00:09:38<nyakase>still seeing those get logged
00:11:30Nact quits [Quit: Konversation terminated!]
00:11:48<Webuser201884>yeah me too, it worked for a bit then started doing it again
00:12:08<nyakase>suspect what you saw is the downloading working properly, but its failing to upload here
00:12:16<Webuser201884>yep
00:12:46<nyakase>it does look like they eventually get to "queued for deduplication" but its real slow
00:12:50<Webuser201884>its "deduplicating" them now apparently
00:13:03<Webuser201884>lol the timing
00:14:17<Webuser201884>does yours also end in urls.wantreadnil
00:14:37<nyakase>yes
00:14:58<Webuser201884>suppose it's nothing im doing wrong? just kinda the process at this time lol
00:15:03<nyakase>yes
00:15:20<Webuser201884>its doing it with Urls as well
00:15:24<nyakase>for context: the urls-grab project partly runs on urls discovered during scraping of other sites, but that seems to be failing at the moment
00:15:31<nyakase>(feeding the discovered urls to it)
00:15:40<Webuser201884>ohhh okay
00:16:09<Webuser201884>Seems like they slowed the traffic with glitch down by my guess, im getting rate limited way harder
00:16:35<nyakase>glitch is stopped at the moment
00:16:44<Webuser201884>ohhh well crap okay hah
00:16:47<nyakase>the "tracker rate limiting" is an archiveteam imposed one
00:17:14<Webuser201884>https://wiki.archiveteam.org/index.php/Projects#Warrior_projects is this the right site i wanna refer to
00:18:00<nyakase>the "warrior-based projects" section under the "current projects" section, not the "warrior projects" section, but yes
00:18:20<Webuser201884>okay
00:18:32<@JAA>(Also appears on the Main Page.)
00:20:23IDK quits [Quit: Connection closed for inactivity]
00:28:31dabs joins
00:33:19Webuser717640 quits [Client Quit]
00:33:23nine joins
00:33:23nine quits [Changing host]
00:33:23nine (nine) joins
00:35:47<Webuser201884>also one last question
00:36:04<Webuser201884>I remember reading that VPNs arent allowed correct? What if i have a dedicated IP
00:36:10<Webuser201884>its with torguard
00:38:37<Webuser201884>im not using it atm, but if i can i probably will. But because im unsure im not for now
00:44:34Webuser211389 joins
00:45:23Webuser211389 leaves
00:45:54Webuser229112 joins
00:47:48<Webuser229112>Hello, I was looking through ArchiveTeam Warrior FAQ, and I noticed that they want "clean" connections. They list a bunch of things of what not to use or do, and one of them mentions "no connections that intercept DNS, an example being ISP's". My ISP is Shaw Communications, and so I was wondering if anyone knows if im able to run this project?
00:48:36<nyakase>so thats what you meant by shaw :) i thought it was slang..
00:49:16<nyakase>(dont actually know the answer, please hold for a while)
00:49:28<Webuser229112>oks
00:53:41rvtr quits [Client Quit]
00:57:27<pokechu22>Webuser229112: If you go to https://nonexistentsubdomain.archiveteam.org, do you get a browser error page, or redirected to something else?
00:59:27Island joins
01:06:07Webuser229112 quits [Client Quit]
01:10:28etnguyen03 (etnguyen03) joins
01:10:53Webuser064333 joins
01:11:13<Webuser064333>hello, i previously asked a question reguarding archive team and shaw communications dns stuff (mom called me to do something)
01:11:30<Webuser064333>someone asked if the link that they send me errors out or is redirected
01:11:44<Webuser064333>and i clicked on the link and it says site cant be reached
01:11:53<nyakase>pokechu22 ^
01:12:47<pokechu22>If that's the case I think you're safe on the DNS side, probably
01:14:27<Webuser064333>cool! sounds good
01:14:40<Webuser064333>ill start setting up ArchiveTeam Warrior
01:41:04etnguyen03 quits [Client Quit]
01:45:31etnguyen03 (etnguyen03) joins
01:49:49<h2ibot>Pokechu22 created E621 (+2570, document current status): https://wiki.archiveteam.org/?title=E621
01:53:58<nicolas17>*some* DNS problems are detected by the warrior code before it starts
01:54:20<nicolas17>for example there's projects where we force using 9.9.9.9 as DNS
01:54:39<nicolas17>some ISPs intercept those requests and use their own DNS server anyway
01:54:50<h2ibot>Pokechu22 edited E621 (+156, mention long description): https://wiki.archiveteam.org/?diff=56380&oldid=56379
01:54:51<nicolas17>and we detect that and throw an error
01:55:31<nicolas17>huh
01:55:34<nicolas17>pokechu22++
01:55:35<eggdrop>[karma] 'pokechu22' now has 175 karma!
01:55:41<nicolas17>the E621 page didn't even exist yet?
01:55:45<pokechu22>Nope
01:56:04<pokechu22>I figured it's probably worth documenting how it was saved before since there were several weird aspects to it
01:56:16<nicolas17>document all the things
01:56:31<nicolas17>I was confused for a second why the diff link didn't load a diff :D
02:11:36Wohlstand quits [Quit: Wohlstand]
02:12:49TheEnbyperor_ quits [Ping timeout: 260 seconds]
02:13:02TheEnbyperor quits [Ping timeout: 276 seconds]
02:14:21corentin quits [Ping timeout: 276 seconds]
02:14:52<KoleiTheBat>anyone knows how long facebook will have me banned from scrapping their ads?
02:15:25corentin joins
02:15:32<KoleiTheBat>never thought i would ask when i will be unbanned from viewing ads XD
02:17:38<nstrom|m>waiting a couple days seems to fix it, not quite sure exactly
02:19:00dabs quits [Read error: Connection reset by peer]
02:22:25etnguyen03 quits [Client Quit]
02:26:02HP_Archivist (HP_Archivist) joins
02:29:03etnguyen03 (etnguyen03) joins
02:37:43TheEnbyperor joins
02:41:58TheEnbyperor_ (TheEnbyperor) joins
02:44:32lemuria quits [Read error: Connection reset by peer]
02:44:54lemuria (lemuria) joins
02:50:14<Webuser201884>Hey nyakase could you tell me if my ISP Telus in British Columbia would be of any problems?
02:50:20<Webuser201884>I dont use their dns'
02:50:44<Webuser201884>I also have a fixed ip and a 3g link
02:51:43etnguyen03 quits [Remote host closed the connection]
03:02:17<Webuser201884>also what the image address be for URLTeam2? Im a little lost to what it is lol
03:02:40<Webuser201884>or is that not one i can work on in a warrior container
03:07:05Webuser064333 quits [Client Quit]
03:32:09cuphead2527480 quits [Quit: Connection closed for inactivity]
03:49:15<TheTechRobo>Webuser201884: Should be atdr.meo.ws/archiveteam/terroroftinytown-client-grab
04:21:27Webuser201884 quits [Quit: Ooops, wrong browser tab.]
04:29:26nine quits [Quit: See ya!]
04:29:39nine joins
04:29:39nine quits [Changing host]
04:29:39nine (nine) joins
04:31:17gosc joins
04:41:34beardicus quits [Ping timeout: 260 seconds]
04:54:50flotwig_ joins
04:56:11flotwig quits [Ping timeout: 276 seconds]
04:56:12flotwig_ is now known as flotwig
04:58:04sec^nd quits [Remote host closed the connection]
04:58:21sec^nd (second) joins
05:39:18Makusu (Makusu) joins
05:59:45Hackerpcs quits [Quit: Hackerpcs]
06:06:07beardicus (beardicus) joins
06:06:17Hackerpcs (Hackerpcs) joins
06:15:20nine quits [Client Quit]
06:15:32nine joins
06:15:34nine quits [Changing host]
06:15:34nine (nine) joins
06:19:02beardicus quits [Read error: Connection reset by peer]
06:19:19beardicus (beardicus) joins
06:22:28awauwa (awauwa) joins
06:31:05lennier2 joins
06:34:09lennier1 quits [Ping timeout: 260 seconds]
07:24:09<Vokun>What's the history behind why that project is so different compared to everything else? Is it more a side project like wikiteam or burnthetwitch used to be?
07:45:32Guest58 joins
07:46:29beardicus quits [Ping timeout: 260 seconds]
07:46:53Guest58 quits [Client Quit]
07:49:08beardicus (beardicus) joins
08:18:34Guest58 joins
08:31:00IDK (IDK) joins
08:50:39Guest58 quits [Ping timeout: 260 seconds]
09:01:34Dada joins
09:02:01Guest58 joins
09:02:16<kpcyrd>arkiver: web1: you run a webserver for your content, web2: somebody else runs a webserver for your content, web3: you use content-addressing identifier for your content, and things are bound to a cryptographic key instead of a domain+ip
09:03:00<kpcyrd>(people who either a) do web1/web2 for living or b) consider content to be capital - don't like that)
09:05:19<kpcyrd>"web3" is usually easier to archive, web1/web2 is "I want to insist you have to talk to my computer"
09:06:57<kpcyrd>(then get upset if a poorly programmed AI scraper does that)
09:07:47<kpcyrd>it's a problem we wouldn't be having with a "mutable torrents" style web
09:21:00<@arkiver>kpcyrd: thanks!
09:35:27<kpcyrd>you're welcome!
09:38:10Island quits [Read error: Connection reset by peer]
09:43:03<BlankEclair>oh huh
09:43:16<BlankEclair>i interpret web1 as serving static files on a computer, and web2 as generating html on the fly
09:53:35nepeat quits [Quit: ZNC - https://znc.in]
09:57:14nepeat (nepeat) joins
09:59:35Guest58 quits [Client Quit]
10:00:24Guest58 joins
10:01:56UwU_93bydbco451y joins
10:30:37Guest58 quits [Client Quit]
10:31:50Guest58 joins
10:36:49Guest58 quits [Ping timeout: 260 seconds]
11:00:03Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat]
11:02:49Bleo182600722719623455222 joins
11:03:57nine quits [Quit: See ya!]
11:04:10nine joins
11:04:10nine quits [Changing host]
11:04:10nine (nine) joins
11:09:29LddPotato quits [Ping timeout: 260 seconds]
11:18:01<@JAA>Vokun: I wasn't around at the time, but there are a couple technical reasons that meant the regular tracker wasn't a good fit for it: rate limiting per shortener was impossible, there was no mechanism for regular/continuous queueing, and all completed items were kept in memory. There are partial solutions for these nowadays, though the rate limiting doesn't scale well.
11:22:49Guest58 joins
11:32:52LddPotato (LddPotato) joins
11:43:13cuphead2527480 (Cuphead2527480) joins
11:48:51riteo (riteo) joins
12:19:06Snivy quits [Read error: Connection reset by peer]
12:31:46Snivy (Snivy) joins
12:54:12rohvani quits [Read error: Connection reset by peer]
13:01:13@arkiver quits [Remote host closed the connection]
13:02:03arkiver (arkiver) joins
13:02:03@ChanServ sets mode: +o arkiver
13:09:22lemuria quits [Read error: Connection reset by peer]
13:10:22lemuria (lemuria) joins
13:13:23@arkiver quits [Remote host closed the connection]
13:13:40arkiver (arkiver) joins
13:13:40@ChanServ sets mode: +o arkiver
13:17:30oldmurray34 joins
13:19:29oldmurray34 quits [Client Quit]
13:29:09@arkiver quits [Remote host closed the connection]
13:30:35oldmurray34 joins
13:30:38arkiver (arkiver) joins
13:30:38@ChanServ sets mode: +o arkiver
13:31:41oldmurray34 quits [Client Quit]
13:31:46oldmurray34 joins
13:34:38oldmurray34 quits [Client Quit]
13:41:19@arkiver quits [Remote host closed the connection]
13:42:37arkiver (arkiver) joins
13:42:37@ChanServ sets mode: +o arkiver
13:49:00@arkiver quits [Remote host closed the connection]
13:50:05arkiver (arkiver) joins
13:50:05@ChanServ sets mode: +o arkiver
13:52:35cuphead2527480 quits [Client Quit]
13:54:34APOLLO03 quits [Ping timeout: 260 seconds]
13:55:09APOLLO03 joins
13:58:10@arkiver quits [Remote host closed the connection]
13:58:25arkiver (arkiver) joins
13:58:25@ChanServ sets mode: +o arkiver
14:03:35that_lurker quits [Remote host closed the connection]
14:03:39that_lurker (that_lurker) joins
14:24:54that_lurker quits [Ping timeout: 260 seconds]
14:25:03that_lurker (that_lurker) joins
14:33:03UwU_93bydbco451y quits [Quit: Dats about it, see ya.]
14:46:33ThetaDev quits [Read error: Connection reset by peer]
14:46:55ThetaDev joins
14:53:57gosc quits [Quit: Leaving]
15:04:37Snivy quits [Client Quit]
15:19:47Snivy (Snivy) joins
15:45:13flotwig_ joins
15:46:11flotwig quits [Ping timeout: 276 seconds]
15:46:12flotwig_ is now known as flotwig
15:52:09Nact joins
15:55:35NF885 (NF885) joins
15:56:22<h2ibot>Hans5958 edited CurrentWarriorProject (-4): https://wiki.archiveteam.org/?diff=56381&oldid=54152
16:01:23<h2ibot>Hans5958 edited Glitch (+2378): https://wiki.archiveteam.org/?diff=56382&oldid=56357
16:02:46KoleiTheBat quits [Quit: Ooops, wrong browser tab.]
16:03:23<h2ibot>Hans5958 edited Glitch (+71): https://wiki.archiveteam.org/?diff=56383&oldid=56382
16:03:49UwU_93bydbco451y joins
16:33:17Nact quits [Remote host closed the connection]
16:38:26grill (grill) joins
16:41:08sludge joins
16:47:49ducky quits [Ping timeout: 260 seconds]
16:48:30<h2ibot>Cooljeanius edited Glitch (+8, use URL template): https://wiki.archiveteam.org/?diff=56384&oldid=56383
16:49:30<h2ibot>Cooljeanius edited Glitch (+33, add separate section for references): https://wiki.archiveteam.org/?diff=56385&oldid=56384
16:50:27<egallager>re: my last edit to the "Glitch" article: how do I get the "References" section to go below the mascot picture?
16:53:27ducky (ducky) joins
16:53:43<TheTechRobo>Looks like replacing 'left' with 'none' will do the trick.
16:56:43Nact joins
16:59:26lennier2_ joins
16:59:50Nact quits [Client Quit]
17:00:02Nact joins
17:02:24lennier2 quits [Ping timeout: 260 seconds]
17:05:05PredatorIWD25 quits [Read error: Connection reset by peer]
17:07:10<egallager>TheTechRobo: thanks, that seems to have worked
17:07:33<h2ibot>Cooljeanius edited Glitch (+0, fix image placement): https://wiki.archiveteam.org/?diff=56386&oldid=56385
17:10:20PredatorIWD25 joins
17:21:49pixel leaves [Error from remote client]
17:26:56flotwig quits [Ping timeout: 276 seconds]
17:28:56beastbg8_ joins
17:32:09beastbg8 quits [Ping timeout: 260 seconds]
17:32:58flotwig joins
17:40:38<h2ibot>Hans5958 moved CurrentWarriorProject to Main Page/Current Warrior Project (Only used on Main Page, similar structure on…): https://wiki.archiveteam.org/?title=Main%20Page/Current%20Warrior%20Project
17:50:03awauwa quits [Quit: awauwa]
17:50:23UwU_93bydbco451y quits [Read error: Connection reset by peer]
17:50:29UwU_93bydbco451y joins
17:55:40<h2ibot>Pokechu22 created E926 (+41, Redirected page to [[E621]]): https://wiki.archiveteam.org/?title=E926
17:56:28arch_ (arch) joins
17:56:40<h2ibot>Pokechu22 edited E621 (+39, comments on posts also not in the DB dumps): https://wiki.archiveteam.org/?diff=56390&oldid=56380
17:56:47arch quits [Remote host closed the connection]
17:56:47arch_ is now known as arch
17:59:30Wohlstand (Wohlstand) joins
18:06:43<h2ibot>Pokechu22 uploaded File:E621 logo.png ({{DISPLAYTITLE|File:e621 logo.png}} Logo of…): https://wiki.archiveteam.org/?title=File%3AE621%20logo.png
18:06:44<h2ibot>Pokechu22 edited File:E621 logo.png (+0): https://wiki.archiveteam.org/?diff=56392&oldid=56391
18:07:15driib9 quits [Quit: The Lounge - https://thelounge.chat]
18:07:43<h2ibot>Pokechu22 edited File:E621 logo.png (+0): https://wiki.archiveteam.org/?diff=56393&oldid=56392
18:08:43<h2ibot>Pokechu22 uploaded File:E926 screenshot.png ({{DISPLAYTITLE:File:e926…): https://wiki.archiveteam.org/?title=File%3AE926%20screenshot.png
18:08:44<h2ibot>Pokechu22 edited E621 (+94, screenshot (safe posts) + logo): https://wiki.archiveteam.org/?diff=56395&oldid=56390
18:09:49driib9 (driib) joins
18:15:41grill quits [Ping timeout: 276 seconds]
18:22:42dabs joins
18:28:16arch_ (arch) joins
18:28:22arch quits [Remote host closed the connection]
18:28:27arch_ is now known as arch
18:35:30UwU_93bydbco451y quits [Remote host closed the connection]
18:40:28BornOn420 quits [Remote host closed the connection]
18:41:09BornOn420 (BornOn420) joins
18:55:49Lord_Nightmare quits [Quit: ZNC - http://znc.in]
18:58:26Lord_Nightmare (Lord_Nightmare) joins
19:01:53<h2ibot>TheTechRobo edited Twitch (-79, /* By TheTechRobo (#burnthetwitch) */ past…): https://wiki.archiveteam.org/?diff=56396&oldid=56296
19:03:08APOLLO03 quits [Ping timeout: 276 seconds]
19:04:55Lord_Nightmare quits [Client Quit]
19:07:56xatixatix3 joins
19:08:24Lord_Nightmare (Lord_Nightmare) joins
19:09:24Larsenv quits [Quit: The Lounge - https://thelounge.chat]
19:09:59Larsenv (Larsenv) joins
19:10:17<nulldata>Looks like the Microsoft Answers site might be going away fully. A few weeks ago it was just the Xbox section, now other sections are also being closed. https://answers.microsoft.com/en-us <- AB is unable to crawl due to JS
19:11:18Larsenv quits [Client Quit]
19:12:23Larsenv (Larsenv) joins
19:13:05Larsenv quits [Client Quit]
19:13:54Larsenv (Larsenv) joins
19:15:04Larsenv quits [Client Quit]
19:15:32Larsenv (Larsenv) joins
19:16:49xatixatix3 quits [Client Quit]
19:23:10kansei (kansei) joins
19:24:45<masterx244|m>JS--
19:24:46<eggdrop>[karma] 'JS' now has -2 karma!
19:26:32dabs quits [Read error: Connection reset by peer]
19:33:03<h2ibot>Nintendofan885 edited Main Page/Current Warrior Project (+4, back to Telegram): https://wiki.archiveteam.org/?diff=56397&oldid=56387
19:43:18<nulldata>I've got a script incrementing through months and pagination, grabbing topic links for Microsoft Answers.
19:43:49<nulldata>Will throw them in AB once finished
19:46:31FiTheArchiver joins
19:58:39Bob joins
20:04:09<h2ibot>Nintendofan885 created Category:ArchiveBot project (+59, create wanted category): https://wiki.archiveteam.org/?title=Category%3AArchiveBot%20project
20:06:09<h2ibot>Nintendofan885 edited File:Splinder homepage.png (-1, moved to files): https://wiki.archiveteam.org/?diff=56399&oldid=6568
20:21:17kansei quits [Client Quit]
20:27:57kansei (kansei) joins
20:29:56Bob leaves
20:31:16Island joins
20:34:13<h2ibot>Nintendofan885 edited AccuWeather (+10, URL template): https://wiki.archiveteam.org/?diff=56400&oldid=47546
21:01:39APOLLO03 joins
21:11:05arcane joins
21:13:06<arcane>hey yall
21:13:28<arcane>is there some sort of ranked ladder i can check to see who's the most contributest of them all?
21:14:23<nstrom|m>not across projects, afaik
21:14:56<arcane>hmm ok, how about per project? Where do I see that?
21:15:14<nstrom|m>https://tracker.archiveteam.org/ and click on a project at the bottom
21:15:26<arcane>cheers!
21:16:04<nstrom|m>at least for currently active projects. historical ones are still on there but not linked so you'd need to know the name
21:17:12<DigitalDragons>for past projects, the wiki page should link to the tracker page
21:17:39arcane quits [Client Quit]
21:37:34etnguyen03 (etnguyen03) joins
22:06:06dabs joins
22:07:36egallager quits [Quit: This computer has gone to sleep]
22:10:06NF885 quits [Quit: Ooops, wrong browser tab.]
22:20:17LddPotato_ joins
22:22:32Wohlstand quits [Quit: Wohlstand]
22:23:14LddPotato quits [Ping timeout: 260 seconds]
22:23:14LddPotato_ is now known as LddPotato
22:57:52FiTheArchiver quits [Read error: Connection reset by peer]
22:58:42dabs quits [Read error: Connection reset by peer]
23:04:25lemuria quits [Read error: Connection reset by peer]
23:05:17beastbg8__ joins
23:05:23lemuria (lemuria) joins
23:07:54Dada quits [Remote host closed the connection]
23:08:11beastbg8_ quits [Ping timeout: 276 seconds]
23:24:48etnguyen03 quits [Client Quit]
23:25:09etnguyen03 (etnguyen03) joins
23:27:40<hexagonwin>it seems like my choimobile.vn forum crawling finished without any visible issue, should i just upload this to archive.org? would be great to make it accessible and having 20+gb of warcs doesn't seem that good..
23:34:54etnguyen03 quits [Client Quit]
23:59:02etnguyen03 (etnguyen03) joins