| 00:04:45 | | kansei- quits [Ping timeout: 268 seconds] |
| 00:07:19 | | kansei (kansei) joins |
| 00:26:50 | | kansei- (kansei) joins |
| 00:28:11 | | kansei quits [Ping timeout: 268 seconds] |
| 00:33:02 | | etnguyen03 (etnguyen03) joins |
| 00:40:49 | | evergreen56 is now known as evergreen |
| 00:50:39 | | BitByBit4 (BitByBit) joins |
| 00:55:56 | | dabs quits [Read error: Connection reset by peer] |
| 01:02:40 | | useretail_ joins |
| 01:05:48 | | useretail__ quits [Ping timeout: 268 seconds] |
| 01:26:09 | | nine quits [Ping timeout: 268 seconds] |
| 01:29:05 | | nine joins |
| 01:38:23 | | polypept1 (polypeptide) joins |
| 01:41:00 | | sg-72 quits [Quit: Leaving] |
| 01:41:44 | | polypeptide quits [Ping timeout: 260 seconds] |
| 01:44:35 | | sg72 joins |
| 01:49:30 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 01:49:35 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 01:54:12 | | TheEnbyperor (TheEnbyperor) joins |
| 01:54:17 | | TheEnbyperor_ joins |
| 01:57:25 | | nine quits [Client Quit] |
| 01:57:43 | | nine joins |
| 02:02:53 | | TheEnbyperor_ quits [Remote host closed the connection] |
| 02:02:53 | | TheEnbyperor quits [Read error: Connection reset by peer] |
| 02:03:04 | | Arcorann (Arcorann) joins |
| 02:13:50 | | TheEnbyperor joins |
| 02:15:19 | | TheEnbyperor_ (TheEnbyperor) joins |
| 02:43:46 | | grill quits [Ping timeout: 268 seconds] |
| 02:44:20 | | grill (grill) joins |
| 03:06:10 | | etnguyen03 quits [Client Quit] |
| 03:07:33 | | etnguyen03 (etnguyen03) joins |
| 03:12:13 | | etnguyen03 quits [Remote host closed the connection] |
| 04:04:38 | | n9nes quits [Ping timeout: 268 seconds] |
| 04:05:37 | | n9nes joins |
| 04:39:48 | | ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 04:40:35 | | ThetaDev joins |
| 05:15:18 | | devkev05 joins |
| 05:17:19 | | devkev0 quits [Ping timeout: 268 seconds] |
| 05:17:19 | | devkev05 is now known as devkev0 |
| 05:22:29 | | nexussfan quits [Quit: Konversation terminated!] |
| 05:22:42 | | helpimbashful (helpimbashful) joins |
| 05:24:00 | | multisn8 (multisn8) joins |
| 05:53:42 | | multisn8 quits [Ping timeout: 268 seconds] |
| 06:10:32 | | multisn8 (multisn8) joins |
| 06:11:39 | | polypept1 quits [*.net *.split] |
| 06:11:39 | | SootBector quits [*.net *.split] |
| 06:11:39 | | retrograde quits [*.net *.split] |
| 06:17:50 | | retrograde (retrograde) joins |
| 06:24:32 | | wickedplayer494 quits [Ping timeout: 268 seconds] |
| 06:31:06 | | polypeptide (polypeptide) joins |
| 06:33:09 | | SootBector (SootBector) joins |
| 06:36:49 | | IceCodeNew|m quits [*.net *.split] |
| 06:36:49 | | rain|m quits [*.net *.split] |
| 06:36:49 | | hillow596|m quits [*.net *.split] |
| 06:36:49 | | Misty|m quits [*.net *.split] |
| 06:36:49 | | will|m quits [*.net *.split] |
| 06:36:49 | | wwww|m quits [*.net *.split] |
| 06:36:49 | | saouroun|m quits [*.net *.split] |
| 06:36:49 | | noxious quits [*.net *.split] |
| 06:36:49 | | Passiing|m quits [*.net *.split] |
| 06:36:49 | | miksters|m quits [*.net *.split] |
| 06:36:49 | | yarnover|m quits [*.net *.split] |
| 06:36:49 | | Claire|m quits [*.net *.split] |
| 06:36:49 | | ampdot|m quits [*.net *.split] |
| 06:36:49 | | yetanotherarchiver|m quits [*.net *.split] |
| 06:36:49 | | username675f|m quits [*.net *.split] |
| 06:36:49 | | gareth48|m quits [*.net *.split] |
| 06:36:49 | | mat|m1 quits [*.net *.split] |
| 06:36:49 | | Tyrasuki|m quits [*.net *.split] |
| 06:36:49 | | Valkum|m quits [*.net *.split] |
| 06:36:49 | | Joy|m quits [*.net *.split] |
| 06:36:49 | | Cronfox|m quits [*.net *.split] |
| 06:36:49 | | kaz__|m quits [*.net *.split] |
| 06:36:49 | | ram|m quits [*.net *.split] |
| 06:36:49 | | PhoHale|m quits [*.net *.split] |
| 06:36:49 | | mister_x quits [*.net *.split] |
| 06:36:49 | | triplecamera|m quits [*.net *.split] |
| 06:36:49 | | katia|m quits [*.net *.split] |
| 06:36:49 | | bogsen quits [*.net *.split] |
| 06:36:49 | | vics quits [*.net *.split] |
| 06:36:49 | | Fijxu|m quits [*.net *.split] |
| 06:36:49 | | osiride|m quits [*.net *.split] |
| 06:36:49 | | Adamvoltagex|m quits [*.net *.split] |
| 06:36:49 | | v1cs quits [*.net *.split] |
| 06:36:49 | | its_notjack quits [*.net *.split] |
| 06:36:49 | | trumad|m quits [*.net *.split] |
| 06:36:49 | | NickS|m quits [*.net *.split] |
| 06:36:49 | | starg2|m quits [*.net *.split] |
| 06:36:49 | | ax|m quits [*.net *.split] |
| 06:36:49 | | spearcat|m quits [*.net *.split] |
| 06:36:49 | | supermariofan67|m quits [*.net *.split] |
| 06:36:49 | | haha-whered-it-go|m quits [*.net *.split] |
| 06:36:49 | | nightpool quits [*.net *.split] |
| 06:36:49 | | joepie91|m quits [*.net *.split] |
| 06:36:49 | | nosamu|m quits [*.net *.split] |
| 06:36:49 | | th3z0l4|m quits [*.net *.split] |
| 06:36:49 | | Nulo|m quits [*.net *.split] |
| 06:36:49 | | ragu|m quits [*.net *.split] |
| 06:36:49 | | nano412510 quits [*.net *.split] |
| 06:36:49 | | mikolaj|m quits [*.net *.split] |
| 06:36:49 | | GhostIsBeHere|m quits [*.net *.split] |
| 06:36:49 | | that_lurker|m quits [*.net *.split] |
| 06:36:49 | | l0rd_enki|m quits [*.net *.split] |
| 06:36:49 | | EvanBoehs|m quits [*.net *.split] |
| 06:36:50 | | jevinskie quits [*.net *.split] |
| 06:36:50 | | akaibu|m quits [*.net *.split] |
| 06:36:50 | | CrispyAlice2 quits [*.net *.split] |
| 06:36:50 | | superusercode quits [*.net *.split] |
| 06:36:50 | | GRBaset quits [*.net *.split] |
| 06:36:50 | | octylFractal|m quits [*.net *.split] |
| 06:36:50 | | s-crypt|m|m quits [*.net *.split] |
| 06:36:50 | | wrangle|m quits [*.net *.split] |
| 06:36:50 | | noobirc|m quits [*.net *.split] |
| 06:36:50 | | jwoglom|m quits [*.net *.split] |
| 06:36:50 | | cmostracker|m quits [*.net *.split] |
| 06:36:50 | | jackt1365|m quits [*.net *.split] |
| 06:36:50 | | Cydog|m quits [*.net *.split] |
| 06:36:50 | | pannekoek11|m quits [*.net *.split] |
| 06:36:50 | | lasdkfj|m quits [*.net *.split] |
| 06:36:50 | | yzqzss quits [*.net *.split] |
| 06:36:50 | | Ruk8 quits [*.net *.split] |
| 06:36:50 | | Video quits [*.net *.split] |
| 06:36:50 | | upperbody321|m quits [*.net *.split] |
| 06:36:50 | | Roki_100|m quits [*.net *.split] |
| 06:36:50 | | madpro|m quits [*.net *.split] |
| 06:36:50 | | thermospheric quits [*.net *.split] |
| 06:36:50 | | hexagonwin|m quits [*.net *.split] |
| 06:36:50 | | victor_vaughn|m quits [*.net *.split] |
| 06:36:50 | | iCesenberk|m quits [*.net *.split] |
| 06:36:50 | | phaeton quits [*.net *.split] |
| 06:36:50 | | coro quits [*.net *.split] |
| 06:36:50 | | e2mau|m quits [*.net *.split] |
| 06:36:50 | | tech234a quits [*.net *.split] |
| 06:36:50 | | moe-a-m|m quits [*.net *.split] |
| 06:36:50 | | Thibaultmol quits [*.net *.split] |
| 06:36:50 | | tech234a|m-backup quits [*.net *.split] |
| 06:36:50 | | andrewvieyra|m quits [*.net *.split] |
| 06:36:50 | | finalti|m quits [*.net *.split] |
| 06:36:50 | | Alienmaster|m quits [*.net *.split] |
| 06:36:50 | | MinePlayersPEMyNey|m quits [*.net *.split] |
| 06:36:50 | | Fletcher quits [*.net *.split] |
| 06:36:50 | | Exorcism|m quits [*.net *.split] |
| 06:36:50 | | aaq|m quits [*.net *.split] |
| 06:36:50 | | nyuuzyou quits [*.net *.split] |
| 06:36:50 | | masterx244|m quits [*.net *.split] |
| 06:36:50 | | xxia|m quits [*.net *.split] |
| 06:36:50 | | @rewby|m quits [*.net *.split] |
| 06:36:50 | | mpeter|m quits [*.net *.split] |
| 06:36:50 | | schwarzkatz|m quits [*.net *.split] |
| 06:36:50 | | tomodachi94 quits [*.net *.split] |
| 06:36:50 | | MaxG quits [*.net *.split] |
| 06:36:50 | | flashfire42|m quits [*.net *.split] |
| 06:36:50 | | nstrom|m quits [*.net *.split] |
| 06:36:50 | | Tom|m1 quits [*.net *.split] |
| 06:36:50 | | Minkafighter|m quits [*.net *.split] |
| 06:36:50 | | alexshpilkin quits [*.net *.split] |
| 06:36:50 | | Vokun quits [*.net *.split] |
| 06:36:50 | | justauser|m quits [*.net *.split] |
| 06:36:50 | | theblazehen|m quits [*.net *.split] |
| 06:36:50 | | Hans5958 quits [*.net *.split] |
| 06:36:50 | | igneousx quits [*.net *.split] |
| 06:36:50 | | DigitalDragon quits [*.net *.split] |
| 06:36:50 | | cruller quits [*.net *.split] |
| 06:36:50 | | gamer191-1|m quits [*.net *.split] |
| 06:36:50 | | x9fff00 quits [*.net *.split] |
| 06:36:50 | | audrooku|m quits [*.net *.split] |
| 06:36:50 | | @Sanqui|m quits [*.net *.split] |
| 06:36:50 | | mind_combatant quits [*.net *.split] |
| 06:36:50 | | MAI|m quits [*.net *.split] |
| 06:36:50 | | britmob|m quits [*.net *.split] |
| 06:36:50 | | anon00001|m quits [*.net *.split] |
| 06:36:50 | | Ajay quits [*.net *.split] |
| 06:40:55 | | rewby|m (rewby) joins |
| 06:40:55 | | @ChanServ sets mode: +o rewby|m |
| 06:43:31 | | MPThLee quits [Quit: bye] |
| 06:44:06 | | MPThLee (MPThLee) joins |
| 06:44:40 | | mpeter|m joins |
| 06:44:40 | | haha-whered-it-go|m joins |
| 06:44:40 | | madpro|m joins |
| 06:44:40 | | NickS|m joins |
| 06:44:40 | | nosamu|m joins |
| 06:44:40 | | coro joins |
| 06:44:40 | | bogsen (bogsen) joins |
| 06:44:40 | | supermariofan67|m joins |
| 06:44:40 | | pannekoek11|m joins |
| 06:44:40 | | andrewvieyra|m joins |
| 06:44:40 | | ragu|m joins |
| 06:44:40 | | jevinskie joins |
| 06:44:40 | | superusercode joins |
| 06:44:40 | | trumad|m joins |
| 06:44:40 | | cmostracker|m joins |
| 06:44:40 | | alexshpilkin joins |
| 06:44:40 | | that_lurker|m joins |
| 06:44:40 | | Adamvoltagex|m joins |
| 06:44:40 | | upperbody321|m joins |
| 06:44:40 | | spearcat|m joins |
| 06:44:40 | | nightpool (nightpool) joins |
| 06:44:40 | | hexagonwin|m joins |
| 06:44:40 | | katia|m joins |
| 06:44:40 | | victor_vaughn|m joins |
| 06:44:40 | | its_notjack (its_notjack) joins |
| 06:44:40 | | MinePlayersPEMyNey|m joins |
| 06:44:40 | | Ruk8 (Ruk8) joins |
| 06:44:40 | | flashfire42|m (flashfire42) joins |
| 06:44:40 | | lasdkfj|m joins |
| 06:44:40 | | schwarzkatz|m joins |
| 06:44:40 | | britmob|m joins |
| 06:44:40 | | Sanqui|m (Sanqui) joins |
| 06:44:40 | | joepie91|m joins |
| 06:44:40 | | Fletcher (Fletcher) joins |
| 06:44:40 | | yzqzss (yzqzss) joins |
| 06:44:40 | | @ChanServ sets mode: +o Sanqui|m |
| 06:44:40 | | x9fff00 (x9fff00) joins |
| 06:44:40 | | moe-a-m|m joins |
| 06:44:40 | | DigitalDragon joins |
| 06:44:40 | | cruller joins |
| 06:44:40 | | Vokun (Vokun) joins |
| 06:44:40 | | tech234a (tech234a) joins |
| 06:44:40 | | l0rd_enki|m joins |
| 06:44:40 | | nyuuzyou joins |
| 06:44:40 | | th3z0l4|m joins |
| 06:44:40 | | EvanBoehs|m joins |
| 06:44:40 | | Nulo|m joins |
| 06:44:40 | | aaq|m joins |
| 06:44:40 | | GhostIsBeHere|m joins |
| 06:44:40 | | Hans5958 joins |
| 06:44:40 | | nstrom|m joins |
| 06:44:40 | | finalti|m joins |
| 06:44:40 | | masterx244|m (masterx244|m) joins |
| 06:44:40 | | Ajay joins |
| 06:44:40 | | xxia|m joins |
| 06:44:40 | | thermospheric joins |
| 06:44:40 | | CrispyAlice2 joins |
| 06:44:40 | | igneousx (igneousx) joins |
| 06:44:40 | | tech234a|m-backup (tech234a) joins |
| 06:44:41 | | Minkafighter|m joins |
| 06:44:41 | | noobirc|m joins |
| 06:44:41 | | Cydog|m joins |
| 06:44:41 | | iCesenberk|m joins |
| 06:44:41 | | GRBaset (GRBaset) joins |
| 06:44:41 | | Alienmaster|m joins |
| 06:44:41 | | ax|m joins |
| 06:44:41 | | e2mau|m joins |
| 06:44:41 | | jwoglom|m joins |
| 06:44:41 | | MaxG joins |
| 06:44:41 | | octylFractal|m joins |
| 06:44:41 | | Roki_100|m joins |
| 06:44:41 | | nano412510 (nano412510) joins |
| 06:44:41 | | audrooku|m joins |
| 06:44:41 | | theblazehen|m joins |
| 06:44:41 | | v1cs joins |
| 06:44:41 | | jackt1365|m joins |
| 06:44:41 | | anon00001|m joins |
| 06:44:41 | | wrangle|m joins |
| 06:44:41 | | Tom|m1 joins |
| 06:44:41 | | phaeton (phaeton) joins |
| 06:44:41 | | vics joins |
| 06:44:41 | | s-crypt|m|m joins |
| 06:44:41 | | akaibu|m joins |
| 06:44:41 | | osiride|m joins |
| 06:44:41 | | gamer191-1|m joins |
| 06:44:41 | | justauser|m (justauser|m) joins |
| 06:44:41 | | Video joins |
| 06:44:41 | | Fijxu|m joins |
| 06:44:41 | | starg2|m joins |
| 06:44:41 | | MAI|m joins |
| 06:44:41 | | Thibaultmol joins |
| 06:44:41 | | mikolaj|m joins |
| 06:44:41 | | Exorcism|m (exorcism) joins |
| 06:44:41 | | tomodachi94 (tomodachi94) joins |
| 06:44:41 | | mind_combatant (mind_combatant) joins |
| 06:44:59 | | triplecamera|m joins |
| 06:45:00 | | mister_x joins |
| 06:45:00 | | PhoHale|m joins |
| 06:45:00 | | wwww|m joins |
| 06:45:00 | | IceCodeNew|m joins |
| 06:45:00 | | hillow596|m joins |
| 06:45:00 | | ram|m joins |
| 06:45:00 | | Passiing|m joins |
| 06:45:00 | | Cronfox|m joins |
| 06:45:00 | | kaz__|m joins |
| 06:45:00 | | saouroun|m joins |
| 06:45:00 | | Misty|m joins |
| 06:45:00 | | rain|m joins |
| 06:45:00 | | ampdot|m joins |
| 06:45:00 | | Joy|m joins |
| 06:45:00 | | Claire|m joins |
| 06:45:00 | | yetanotherarchiver|m joins |
| 06:45:00 | | yarnover|m joins |
| 06:45:00 | | miksters|m joins |
| 06:45:00 | | noxious joins |
| 06:45:00 | | username675f|m joins |
| 06:45:00 | | will|m joins |
| 06:45:00 | | Valkum|m joins |
| 06:45:01 | | mat|m1 joins |
| 06:45:01 | | Tyrasuki|m joins |
| 06:45:01 | | gareth48|m joins |
| 06:46:17 | <helpimbashful> | Semi-random Internet Archive question: does anyone know where/how the Internet Archive obtains its dark archives? I can't find an example right now, but for example I've seen archive.org pages for music albums that just have a 30-second preview of each track and aren't downloadable |
| 06:49:40 | | M--mlv|m joins |
| 06:55:51 | <pokechu22> | #internetarchive is probably a better channel or that. My impression was that those are CDs that IA has in their physical collection and ripped themself, but I don't have a source |
| 07:03:03 | <helpimbashful> | thanks! |
| 07:57:35 | | bilboed08 quits [Read error: Connection reset by peer] |
| 07:57:41 | | bilboed08 joins |
| 08:29:27 | | Arachnophine quits [Quit: Ping timeout (120 seconds)] |
| 08:29:34 | | Arachnophine (Arachnophine) joins |
| 09:34:02 | | retrograde quits [Ping timeout: 240 seconds] |
| 09:34:25 | | retrograde (retrograde) joins |
| 11:00:04 | | Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat] |
| 11:02:51 | | Bleo1826007227196234552220110 joins |
| 11:09:52 | | SootBector quits [Remote host closed the connection] |
| 11:11:02 | | SootBector (SootBector) joins |
| 11:24:38 | | polypeptide quits [Remote host closed the connection] |
| 11:28:13 | | polypeptide (polypeptide) joins |
| 11:28:46 | | SootBector quits [Remote host closed the connection] |
| 11:29:06 | | SootBector (SootBector) joins |
| 11:31:02 | | Webuser572193 quits [Quit: Ooops, wrong browser tab.] |
| 11:38:25 | | nine quits [Ping timeout: 268 seconds] |
| 11:38:59 | | nine joins |
| 11:47:27 | | Paw-chivist joins |
| 11:47:31 | <Paw-chivist> | Hi everyone ! :) |
| 11:49:36 | | nine quits [Ping timeout: 268 seconds] |
| 11:52:50 | | nine joins |
| 12:05:08 | | retrograde quits [Remote host closed the connection] |
| 12:06:08 | | retrograde (retrograde) joins |
| 12:11:06 | | retrograde quits [Remote host closed the connection] |
| 12:11:46 | | retrograde (retrograde) joins |
| 12:35:48 | | Paw-chivist quits [Client Quit] |
| 12:42:41 | <h2ibot> | Cruller edited List of website hosts (+52, /* X */ Added XS4ALL homepages): https://wiki.archiveteam.org/?diff=61113&oldid=58960 |
| 12:49:44 | | VerifiedJ7 quits [Remote host closed the connection] |
| 12:50:24 | | VerifiedJ7 (VerifiedJ) joins |
| 12:54:42 | <h2ibot> | Cruller edited List of website hosts (+64, Imported [[Web_Roasting#Standalone_services]]): https://wiki.archiveteam.org/?diff=61114&oldid=61113 |
| 13:00:43 | <h2ibot> | Cruller edited Web Roasting (-26, Added link to [[List of website hosts]] and…): https://wiki.archiveteam.org/?diff=61115&oldid=59787 |
| 13:06:43 | <h2ibot> | Cruller edited Web Roasting (+28, /* Lists of hosts */ Added [[List of website…): https://wiki.archiveteam.org/?diff=61116&oldid=61115 |
| 13:19:45 | | Webuser602420 joins |
| 13:24:35 | | Webuser200066 joins |
| 13:25:07 | | Webuser200066 quits [Client Quit] |
| 13:52:55 | <h2ibot> | Justauser edited List of major MediaWiki wikis with the LinkSearch extension (+72, WTF_delete.png - put Wikinews back): https://wiki.archiveteam.org/?diff=61117&oldid=61112 |
| 13:59:51 | <klea> | Oh, I should finish the thing to do the job of mwlinkscrape in bash. |
| 14:24:23 | | Arcorann quits [Ping timeout: 268 seconds] |
| 14:31:45 | | Nekroschizofrenetyk joins |
| 14:35:08 | | Nekroschizofrenetyk quits [Client Quit] |
| 14:35:50 | | Nekroschizofrenetyk joins |
| 14:36:41 | | eythian quits [Quit: http://quassel-irc.org - Chat comfortabel. Waar dan ook.] |
| 14:37:56 | | eythian joins |
| 14:44:01 | <cruller> | Speaking of which, why not periodically save all the pages linked from ATwiki so that you can detect any signs of them dying? |
| 14:45:44 | | lflare quits [Quit: Bye] |
| 14:46:30 | | lflare (lflare) joins |
| 14:49:11 | <pabs> | there is a DPoS project for all wikis coming that will do the former, but I think not the latter |
| 14:49:37 | <cruller> | Actually, what I meant wasn’t "all", but rather the blacklist approach ("when in doubt, save it"). |
| 14:50:32 | <pabs> | (IIRC the wiki thing queues outlinks to #// - the URLs project) |
| 14:52:03 | | szczot3k quits [Ping timeout: 268 seconds] |
| 15:01:18 | | ducky quits [Ping timeout: 268 seconds] |
| 15:01:29 | | ducky (ducky) joins |
| 15:03:06 | <cruller> | pabs: I'm not sure how detailed analysis should be to detect them, but I think even just checking the status codes would be somewhat useful. |
| 15:06:40 | <cruller> | Well, detailed analysis probably wouldn’t be worth the effort. Even with the most thorough investigation, it is impossible to predict every death. |
| 15:11:10 | | jinn6 quits [Ping timeout: 268 seconds] |
| 15:13:58 | <cruller> | Anyway, thanks for telling me about the DPoS project! |
| 15:14:35 | <klea> | https://wiki.archiveteam.org/index.php/URLs#:~:text=Also%2C%20if%20you%20run%20at%20significant%20speed%2C%20you%27ll%20likely%20see%20abuse%20notices%2C%20IP%20blacklists%2C%20and%20so%20on%2E |
| 15:17:04 | | jinn6 (jinn6) joins |
| 15:24:46 | <justauser> | Yeah, people brave enough to run #// are in high demand. |
| 15:37:35 | | Nekroschizofrenetyk quits [Client Quit] |
| 15:42:54 | | Nekroschizofrenetyk joins |
| 15:58:44 | | ducky quits [Ping timeout: 268 seconds] |
| 16:11:59 | | Nekroschizofrenetyk quits [Client Quit] |
| 16:14:03 | | ducky (ducky) joins |
| 16:20:42 | | ducky_ (ducky) joins |
| 16:21:33 | | ducky quits [Ping timeout: 268 seconds] |
| 16:21:34 | | ducky_ is now known as ducky |
| 16:24:07 | <klea> | I wonder how well bash would fare against doing something like declare -a found_urls=() and then somehow checking if a URL is in the array and handling 25K urls in the array. |
| 16:24:27 | <klea> | I suppose it might be better to dump url list into files or something to |
| 16:24:39 | <klea> | sort | uniq them, but that'd need all the input at first. |
| 16:25:19 | | szczot3k (szczot3k) joins |
| 16:30:43 | | szczot3k quits [Ping timeout: 268 seconds] |
| 16:38:25 | | kutuk9 joins |
| 16:38:44 | <kutuk9> | !help |
| 16:39:16 | <h2ibot> | Cooljeanius edited Deathwatch (-2, /* 2026-10 */ copyedit): https://wiki.archiveteam.org/?diff=61118&oldid=61111 |
| 16:42:12 | <kutuk9> | #down-the-tube |
| 16:44:17 | <h2ibot> | Cooljeanius edited Deathwatch (+46, /* 2026-05 */ copyedit): https://wiki.archiveteam.org/?diff=61119&oldid=61118 |
| 16:48:17 | <h2ibot> | Justauser edited Tripod (+24, DPoS archiving in progress): https://wiki.archiveteam.org/?diff=61120&oldid=61073 |
| 16:54:00 | | kutuk9 quits [Client Quit] |
| 16:55:18 | <h2ibot> | Justauser edited Angelfire (+8, Offline for good?): https://wiki.archiveteam.org/?diff=61121&oldid=61036 |
| 17:07:46 | <klea> | https://transfer.archivete.am/hcAKp/mwlinkscraper.bash |
| 17:07:47 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/hcAKp/mwlinkscraper.bash |
| 17:10:28 | <justauser> | Seems to use different format for the wiki list? |
| 17:12:28 | <klea> | It uses the same page, it just doesn't try to parse it as html :p |
| 17:12:58 | <klea> | https://wiki.archiveteam.org/index.php/List_of_major_MediaWiki_wikis_with_the_LinkSearch_extension?action=raw will give you the wiki page, so I just grep for lines begining with supported protocols by curl. |
| 17:13:14 | <justauser> | I can see it does |
| 17:13:16 | <justauser> | ${mediawiki_url}/Special:LinkSearch?limit=${OFFSET_INC}&offset=${offset}&target=$target |
| 17:13:30 | <klea> | Oh that, yes. I believe all wikis do it the same way. |
| 17:13:42 | <justauser> | Which implies en.wikipedia.org/wiki -style list. |
| 17:13:44 | <klea> | At least wikipedia and AT's wiki behaves it. |
| 17:14:14 | <justauser> | But our current list is https://en.wikipedia.org/w/index.php -style. |
| 17:14:22 | <klea> | https://wiki.archiveteam.org/index.php/Special:MyPage |
| 17:14:53 | <klea> | It seems to work when appended to the index.php. |
| 17:15:14 | <klea> | I can put it to do ?title=... if desired. |
| 17:15:27 | <justauser> | If you say it works... ¯\_(ツ)_/¯ |
| 17:15:35 | <klea> | https://en.wikipedia.org/w/index.php/Special:LinkSearch |
| 17:15:49 | <klea> | I should have added a -L into that one. |
| 17:16:09 | <klea> | Because fr.wikipedia.org for example redirects you. |
| 17:16:49 | <klea> | https://transfer.archivete.am/qVsma/mwlinkscraper.bash.zst |
| 17:17:45 | | kansei- quits [Quit: ZNC 1.10.1 - https://znc.in] |
| 17:26:27 | | szczot3k (szczot3k) joins |
| 17:26:42 | <klea> | Uhh, I'm not entirely sure it handles finishing a site properly. |
| 17:31:09 | | szczot3k quits [Ping timeout: 268 seconds] |
| 17:31:09 | | chunkynutz6018 quits [Read error: Connection reset by peer] |
| 17:31:39 | | chunkynutz6018 joins |
| 17:35:23 | | Paw-chivist joins |
| 17:35:25 | <Paw-chivist> | Hi everyone ! :) |
| 17:38:34 | <justauser> | Hello. |
| 17:38:48 | | flotwig quits [Quit: ZNC - http://znc.in] |
| 17:39:53 | | kansei (kansei) joins |
| 17:40:29 | | retrograde quits [Remote host closed the connection] |
| 17:40:44 | | flotwig joins |
| 17:40:54 | | retrograde (retrograde) joins |
| 17:43:13 | <Paw-chivist> | Today, while reading the wiki, I noticed that a lot of link list pages are missing. For example Libaneese medias or Algerian political parties. Does someone know if it's useful to create those pages ? And, by the way, do you have to do something to update pages like [[ArchiveBot/French political parties/list]] with HadeanEon ? |
| 17:45:52 | <justauser> | Seems to be somewhat abandoned? |
| 17:46:13 | <Paw-chivist> | The bot, yep' it seems. :/ |
| 17:46:29 | <justauser> | pokechu22 does most of the political party stuff in #vooterbooter. |
| 17:46:40 | <Paw-chivist> | Someone asked the source code to do another bot, but I don't know if there's any news about that. |
| 17:47:37 | <justauser> | You mean mwlinkscrape? |
| 17:47:39 | <Paw-chivist> | I would be very happy to help pokechu22, the pages need a little update :3 |
| 17:47:40 | <pokechu22> | I've mostly been focusing on US stuff, and generally I don't use those pages on the wiki. I'm not sure if c3manu does or not. My workflow is generally using Wikipedia and looking up parties that way - other lists often get outdated too fast |
| 17:47:48 | <Paw-chivist> | No I mean HadeanEon |
| 17:48:21 | | lennier2 joins |
| 17:48:50 | <Paw-chivist> | Oh so you don't update the wiki pages pokechu22 ? :( |
| 17:48:56 | <Paw-chivist> | Thanks for your response brw :3 |
| 17:48:58 | <Paw-chivist> | btw* |
| 17:49:23 | <pokechu22> | I generally don't update those lists. I do update some wiki pages, but I don't use those ArchiveBot list subpages |
| 17:50:57 | <Paw-chivist> | Oh, okay |
| 17:51:04 | <Paw-chivist> | And do you use https://wiki.archiveteam.org/index.php?title=Elections ? |
| 17:51:30 | | lennier2_ quits [Ping timeout: 268 seconds] |
| 17:52:07 | <Paw-chivist> | I will add 2026 French municipal elections |
| 17:52:29 | <pokechu22> | No, I use https://ballotpedia.org/Elections_calendar#Upcoming_election_dates and I think c3manu uses https://en.wikipedia.org/wiki/2026_national_electoral_calendar#March |
| 17:54:25 | <Paw-chivist> | Oh :/ |
| 17:55:05 | <Paw-chivist> | And for political parties ? Like outside the elections periods ? |
| 17:58:53 | <c3manu> | Paw-chivist, pokechu22: i occasionally attempt to fill out some wiki pages but it quickly gets too much work, and almost nobody else works on stuff like that. for now i'm okay with adding political party and govt stuff to the urls-sources when i encounter them archiving #vooterbooter stuff |
| 17:59:52 | <c3manu> | i think my most enthusiastic attempt so far has been https://wiki.archiveteam.org/index.php/Abkhazia :/ |
| 18:01:27 | <h2ibot> | Manu edited Abkhazia (+1, Fix typo): https://wiki.archiveteam.org/?diff=61122&oldid=60059 |
| 18:06:04 | <Paw-chivist> | I can try to expand a lot https://wiki.archiveteam.org/index.php/France if you're ok with that ? |
| 18:06:25 | <klea> | If someone wants to write updated bot code I can have it run with all the current ones KleaBot runs. |
| 18:14:11 | <c3manu> | Paw-chivist: i don't think anyone would oppose that :) |
| 18:15:41 | <c3manu> | klea: oh, does it also meddle with those per country or election lists as well? how? |
| 18:16:14 | <klea> | No, KleaBot doesn't have anything to touch election lists (yet). |
| 18:16:27 | <klea> | If people give me an idea or sketch of how to do it, or the code, I can try to implement it. |
| 18:16:29 | <klea> | If wanted. |
| 18:19:56 | <c3manu> | I don't think it would work well. Most of the time there is manual research required. I could probably spend a few weeks' evenings just entering the political party and politician websites i found for #vooterbooter stuff into the wikipedia or wikidata.. |
| 18:24:07 | <klea> | Then not doing it. |
| 18:24:24 | <klea> | Re doing the code that the other bot did to check if websites are archived on AB would be neat tho. |
| 18:24:34 | <klea> | Even if the /list pages would be human filled. |
| 18:31:45 | | steering quits [Quit: [TLS] Client upgrade] |
| 18:35:00 | <Paw-chivist> | So to begin I did a little list of gouv.fr sites on https://wiki.archiveteam.org/index.php/France |
| 18:35:15 | <Paw-chivist> | To you, can it be useful ? |
| 18:45:07 | | steering (steering) joins |
| 18:48:52 | | szczot3k (szczot3k) joins |
| 18:51:03 | | steering quits [Client Quit] |
| 18:51:12 | | steering (steering) joins |
| 19:07:57 | | unknownsrc2 (unknownsrc) joins |
| 19:07:58 | | unknownsrc quits [Ping timeout: 268 seconds] |
| 19:07:58 | | unknownsrc2 is now known as unknownsrc |
| 19:08:25 | <Paw-chivist> | And I added a list of public librarys urls |
| 19:08:35 | <Paw-chivist> | libraries* |
| 19:12:38 | <h2ibot> | User edited Discord (-40, /* Dictionaries */): https://wiki.archiveteam.org/?diff=61123&oldid=60485 |
| 19:12:39 | <h2ibot> | User edited France (+169832): https://wiki.archiveteam.org/?diff=61124&oldid=58007 |
| 19:13:38 | <h2ibot> | John5433 edited Soyjak.party (+618, /* archive.soyjak.org */): https://wiki.archiveteam.org/?diff=61125&oldid=60870 |
| 19:13:39 | <h2ibot> | John5433 edited 4chan (+432, /* Lost Archives */): https://wiki.archiveteam.org/?diff=61126&oldid=60824 |
| 19:13:40 | <h2ibot> | John5433 created Category:Endangered (+0, Created blank page): https://wiki.archiveteam.org/?oldid=61127 |
| 19:13:41 | <h2ibot> | John5433 uploaded File:WorldAthleticProject.png: https://wiki.archiveteam.org/?title=File%3AWorldAthleticProject.png |
| 19:13:42 | <h2ibot> | John5433 uploaded File:Pensivenonsen.png: https://wiki.archiveteam.org/?title=File%3APensivenonsen.png |
| 19:13:43 | <h2ibot> | John5433 uploaded File:Foolzashit.png: https://wiki.archiveteam.org/?title=File%3AFoolzashit.png |
| 19:13:44 | <h2ibot> | John5433 uploaded File:Wakarimasen.png: https://wiki.archiveteam.org/?title=File%3AWakarimasen.png |
| 19:13:45 | <h2ibot> | JustAnotherArchivist changed the user rights of User:John5433 |
| 19:15:39 | <h2ibot> | Javascriptone edited Amino (+11, Grammatical fixes, as well as date format fixes.): https://wiki.archiveteam.org/?diff=61133&oldid=60246 |
| 19:19:10 | <c3manu> | Paw-chivist++ |
| 19:19:11 | <eggdrop> | [karma] 'Paw-chivist' now has 1 karma! |
| 19:21:27 | <Paw-chivist> | I'm cleaning the libraries list, sorry for the big edit |
| 19:21:51 | <steering> | arf arf |
| 19:22:13 | <@JAA> | Paw-chivist: Doesn't look like I can edit your username on the wiki currently, so that'll have to wait. |
| 19:22:15 | <steering> | oh this isn't -ot :) |
| 19:22:44 | <Paw-chivist> | No problem if the edit is in pending, I'm editing the list on my computer |
| 19:23:00 | <h2ibot> | JustAnotherArchivist changed the user rights of User:User |
| 19:23:17 | <Paw-chivist> | Oh |
| 19:24:44 | <Paw-chivist> | Thanks ! <3 |
| 19:40:06 | <Paw-chivist> | Done ! |
| 19:40:52 | <h2ibot> | User edited France (-33040, /* Public libraries */): https://wiki.archiveteam.org/?diff=61134&oldid=61124 |
| 19:41:01 | <klea> | Oh, Paw-chivist didn't rejoin the #webroasting channel :p (also join #archiveteam-ot I guess if you want) |
| 19:41:36 | <Paw-chivist> | My browser can't remember the channels :/ |
| 19:41:55 | <Paw-chivist> | It works for kiwirc but not for hackint :P |
| 19:41:55 | <@JAA> | If you intend to hang out here in the longer term, I'd recommend getting a real IRC client. |
| 19:42:22 | <Paw-chivist> | It's a good idea, thanks JAA ! <3 |
| 19:44:37 | <Paw-chivist> | So, what do you think about this massive list ? |
| 19:44:43 | <Paw-chivist> | What should we do next ? |
| 20:33:17 | | Wohlstand (Wohlstand) joins |
| 20:49:37 | <Paw-chivist> | Here's the cleanest list of links for public librairies : |
| 20:50:01 | <h2ibot> | User edited France (-24200, /* Public libraries */): https://wiki.archiveteam.org/?diff=61135&oldid=61134 |
| 20:53:04 | <Paw-chivist> | So is there a procedure of what to do with those links or we send them to AB ? |
| 20:54:36 | | klea[convos] joins |
| 20:57:02 | | retrograde quits [Ping timeout: 240 seconds] |
| 20:57:07 | | retrograde (retrograde) joins |
| 20:57:32 | <klea> | Probably AB, but I don't know how they deal with getting a ton of links at once. |
| 21:00:16 | <nicolas17> | that edit made the page 24KB *smaller*? |
| 21:00:25 | <Paw-chivist> | Yep |
| 21:00:44 | <Paw-chivist> | If I get the perms, I can send it one by one and add job IDs to the wiki page, but maybe someone know how to do it quickly and in a better way :3 |
| 21:00:59 | <klea> | queueh2i. |
| 21:02:54 | <klea> | What did you do with the wiki page, remove all Public libraries, order them, and dedupe them and put them at the end? |
| 21:04:03 | <h2ibot> | User edited France (-86, /* Public libraries */): https://wiki.archiveteam.org/?diff=61136&oldid=61135 |
| 21:04:53 | <Paw-chivist> | So what I did : I grabbed all the Public librairies websites from a dataset on data.gouv.fr, then I deduped them, checked for only librairies websites, then checked only for up websites (no 404 or other problems), then deduped again. |
| 21:06:23 | <klea> | Ah. |
| 21:18:35 | <Paw-chivist> | It's bad ? |
| 21:19:18 | <klea> | No? |
| 21:19:33 | <Paw-chivist> | Ah, ouf. |
| 21:20:59 | <@JAA> | Keeping a record of the ones that are dead would be good though. |
| 21:22:08 | <klea> | Yeah. |
| 21:31:02 | | steering7254 joins |
| 21:33:03 | | Webuser418391 joins |
| 21:33:12 | | Webuser418391 quits [Client Quit] |
| 21:35:47 | | steering7254 leaves |
| 21:36:12 | | etnguyen03 (etnguyen03) joins |
| 21:37:32 | | Paw-chivist quits [Quit: Ooops, wrong browser tab.] |
| 22:03:14 | | nexussfan (nexussfan) joins |
| 22:04:48 | | etnguyen03 quits [Client Quit] |
| 22:06:52 | <cruller> | JAA: What is the recommended way to do that on the wiki? Sometimes {{offline}} is added, sometimes <s> is used, and sometimes it is simply deleted or overwritten. |