00:00:34 | | jtagcat quits [Client Quit] |
00:02:20 | | jtagcat (jtagcat) joins |
00:19:18 | | Megame1_ (Megame) joins |
00:21:00 | | Megame quits [Read error: Connection reset by peer] |
00:22:36 | | Megame1_ is now known as Megame |
00:24:22 | <nicolas17> | wow |
00:24:25 | <nicolas17> | my system got all laggy |
00:24:30 | <nicolas17> | turns out it was swapping |
00:24:42 | <nicolas17> | because the archivebot.com tab was using 3GB of RAM and growing |
00:26:51 | <Pedrosso> | Haha. It tends to do that |
00:27:41 | <@JAA> | How long did you have it open? |
00:27:53 | <nicolas17> | like a minute or two, idk what's up with that |
00:28:04 | <@JAA> | I've had it open for hours, and it uses under 200 MB. |
00:28:05 | <nicolas17> | I expanded the log of a job |
00:28:13 | <nicolas17> | which may have affected it |
00:29:24 | <@JAA> | ~300 MB when expanding all logs, although it disappeared from about:performance for a bit. lol |
00:30:16 | <@JAA> | Firefox, by the way. |
00:32:12 | <nicolas17> | I should change my support.apple.com scraping script to not trash my SSD... |
00:32:30 | <@JAA> | /dev/shm <3 |
00:32:47 | <nicolas17> | instead of "rm data/*; download everything; if git diff --quiet; then commit; fi" |
00:33:03 | <nicolas17> | I should read the existing file and compare it with what was downloaded, if it's the same then don't write anything |
00:33:07 | | icedice quits [Client Quit] |
00:36:03 | <@JAA> | Ah |
00:56:35 | <nicolas17> | it's like 480MB, I wouldn't want to keep that in memory between runs |
01:10:26 | <Pedrosso> | Oh yeah not from like a minute or two, at least for me |
02:14:44 | | Arcorann quits [Remote host closed the connection] |
02:20:31 | | Arcorann (Arcorann) joins |
03:16:59 | | pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.] |
03:18:49 | | pabs (pabs) joins |
03:29:14 | | M--mlv|m quits [*.net *.split] |
03:29:15 | | marius851000 quits [*.net *.split] |
03:29:15 | | that_lurker|m quits [*.net *.split] |
03:29:15 | | hillow596|m quits [*.net *.split] |
03:29:15 | | Nulo|m quits [*.net *.split] |
03:29:15 | | sonst-was|m quits [*.net *.split] |
03:29:15 | | EvanBoehs|m quits [*.net *.split] |
03:29:15 | | alexshpilkin quits [*.net *.split] |
03:29:15 | | qq44|m quits [*.net *.split] |
03:29:15 | | Misty|m quits [*.net *.split] |
03:29:15 | | username675f|m quits [*.net *.split] |
03:29:15 | | AntoninDelFabbro|m quits [*.net *.split] |
03:29:15 | | Maakuth|m quits [*.net *.split] |
03:29:15 | | Peetz0r|m quits [*.net *.split] |
03:29:15 | | EmeraldSnorlax|m quits [*.net *.split] |
03:29:15 | | kaz__|m quits [*.net *.split] |
03:29:15 | | ram|m quits [*.net *.split] |
03:29:15 | | Passiing|m quits [*.net *.split] |
03:29:15 | | noxious quits [*.net *.split] |
03:29:16 | | yetanotherarchiver|m quits [*.net *.split] |
03:29:16 | | trumad|m quits [*.net *.split] |
03:29:16 | | NickS|m quits [*.net *.split] |
03:29:16 | | haha-whered-it-go|m quits [*.net *.split] |
03:29:16 | | joepie91|m quits [*.net *.split] |
03:29:16 | | gwetchen|m quits [*.net *.split] |
03:29:16 | | superusercode quits [*.net *.split] |
03:29:16 | | noobirc|m quits [*.net *.split] |
03:29:16 | | GRBaset quits [*.net *.split] |
03:29:16 | | lasdkfj|m quits [*.net *.split] |
03:29:16 | | nyuuzyou quits [*.net *.split] |
03:29:16 | | jevinskie quits [*.net *.split] |
03:29:16 | | will|m quits [*.net *.split] |
03:29:16 | | JC|m quits [*.net *.split] |
03:29:16 | | pannekoek11|m quits [*.net *.split] |
03:29:16 | | jwoglom|m quits [*.net *.split] |
03:29:16 | | gungagungagunga|m quits [*.net *.split] |
03:29:17 | | coro quits [*.net *.split] |
03:29:17 | | Cydog|m quits [*.net *.split] |
03:29:17 | | t3chler|m quits [*.net *.split] |
03:29:17 | | akaibu|m quits [*.net *.split] |
03:29:17 | | qyxojzh|m quits [*.net *.split] |
03:29:17 | | thermospheric quits [*.net *.split] |
03:29:17 | | hlgs|m quits [*.net *.split] |
03:29:17 | | vexr quits [*.net *.split] |
03:29:17 | | Video quits [*.net *.split] |
03:29:17 | | Ruk8 quits [*.net *.split] |
03:29:17 | | Max|m12 quits [*.net *.split] |
03:29:17 | | iCesenberk|m quits [*.net *.split] |
03:29:17 | | octylFractal|m quits [*.net *.split] |
03:29:17 | | wrangle|m quits [*.net *.split] |
03:29:17 | | nosamu|m quits [*.net *.split] |
03:29:17 | | masterx244|m quits [*.net *.split] |
03:29:17 | | voltagex|m quits [*.net *.split] |
03:29:17 | | mikolaj|m quits [*.net *.split] |
03:29:17 | | tech234a|m quits [*.net *.split] |
03:29:17 | | Roki_100|m quits [*.net *.split] |
03:29:17 | | finalti|m quits [*.net *.split] |
03:29:18 | | moe-a-m|m quits [*.net *.split] |
03:29:18 | | schwarzkatz|m quits [*.net *.split] |
03:29:18 | | jackt1365|m quits [*.net *.split] |
03:29:18 | | saouroun|m quits [*.net *.split] |
03:29:18 | | madpro|m quits [*.net *.split] |
03:29:18 | | Minkafighter|m quits [*.net *.split] |
03:29:18 | | yzqzss quits [*.net *.split] |
03:29:18 | | x9fff00 quits [*.net *.split] |
03:29:18 | | Exorcism quits [*.net *.split] |
03:29:18 | | phaeton quits [*.net *.split] |
03:29:18 | | Tom|m1 quits [*.net *.split] |
03:29:18 | | Froxcey|m quits [*.net *.split] |
03:29:18 | | manu|m quits [*.net *.split] |
03:29:18 | | s-crypt|m|m quits [*.net *.split] |
03:29:18 | | CrispyAlice2 quits [*.net *.split] |
03:29:18 | | Fletcher quits [*.net *.split] |
03:29:18 | | ragu|m quits [*.net *.split] |
03:29:18 | | flashfire42|m quits [*.net *.split] |
03:29:18 | | MinePlayersPEMyNey|m quits [*.net *.split] |
03:29:18 | | Thibaultmol quits [*.net *.split] |
03:29:18 | | nstrom|m quits [*.net *.split] |
03:29:18 | | Hans5958 quits [*.net *.split] |
03:29:18 | | theblazehen|m quits [*.net *.split] |
03:29:18 | | mpeter|m quits [*.net *.split] |
03:29:18 | | rewby|m quits [*.net *.split] |
03:29:18 | | Vokun quits [*.net *.split] |
03:29:18 | | tomodachi94 quits [*.net *.split] |
03:29:19 | | audrooku|m quits [*.net *.split] |
03:29:19 | | britmob|m quits [*.net *.split] |
03:29:19 | | xxia|m quits [*.net *.split] |
03:29:19 | | cmostracker|m quits [*.net *.split] |
03:29:19 | | DigitalDragon quits [*.net *.split] |
03:29:19 | | andrewvieyra|m quits [*.net *.split] |
03:29:19 | | mind_combatant quits [*.net *.split] |
03:29:19 | | @Sanqui|m quits [*.net *.split] |
03:29:19 | | igneousx quits [*.net *.split] |
03:29:19 | | Ajay quits [*.net *.split] |
03:36:24 | <Ryz> | Is there anything else to save from Evernote? Is there user content to get? Considering https://techcrunch.com/2023/11/29/its-official-evernote-will-restrict-free-users-to-50-notes/ |
03:38:59 | | M--mlv|m joins |
03:38:59 | | marius851000 joins |
03:38:59 | | that_lurker|m joins |
03:38:59 | | hillow596|m joins |
03:38:59 | | Nulo|m joins |
03:38:59 | | sonst-was|m joins |
03:38:59 | | EvanBoehs|m joins |
03:38:59 | | qq44|m joins |
03:38:59 | | alexshpilkin joins |
03:38:59 | | Misty|m joins |
03:39:00 | | username675f|m joins |
03:39:00 | | AntoninDelFabbro|m joins |
03:39:00 | | Maakuth|m joins |
03:39:00 | | Peetz0r|m joins |
03:39:00 | | EmeraldSnorlax|m joins |
03:39:00 | | kaz__|m joins |
03:39:00 | | ram|m joins |
03:39:00 | | Passiing|m joins |
03:39:00 | | noxious joins |
03:39:00 | | Exorcism (exorcism) joins |
03:39:00 | | JC|m joins |
03:39:00 | | s-crypt|m|m joins |
03:39:00 | | iCesenberk|m joins |
03:39:00 | | qyxojzh|m joins |
03:39:00 | | octylFractal|m joins |
03:39:00 | | flashfire42|m joins |
03:39:00 | | coro joins |
03:39:00 | | phaeton (phaeton) joins |
03:39:00 | | Vokun (Vokun) joins |
03:39:00 | | Video joins |
03:39:00 | | yetanotherarchiver|m joins |
03:39:00 | | noobirc|m joins |
03:39:00 | | cmostracker|m joins |
03:39:00 | | gungagungagunga|m joins |
03:39:00 | | Tom|m1 joins |
03:39:00 | | Cydog|m joins |
03:39:00 | | trumad|m joins |
03:39:00 | | Roki_100|m joins |
03:39:00 | | nosamu|m joins |
03:39:00 | | jwoglom|m joins |
03:39:00 | | Max|m12 joins |
03:39:00 | | manu|m joins |
03:39:00 | | superusercode (superusercode) joins |
03:39:00 | | nyuuzyou (nyuuzyou) joins |
03:39:00 | | vexr joins |
03:39:00 | | wrangle|m joins |
03:39:00 | | moe-a-m|m joins |
03:39:00 | | Hans5958 (Hans5958) joins |
03:39:00 | | will|m joins |
03:39:00 | | jevinskie (jevinskie) joins |
03:39:00 | | gwetchen|m joins |
03:39:00 | | yzqzss (yzqzss) joins |
03:39:00 | | CrispyAlice2 (CrispyAlice2) joins |
03:39:00 | | voltagex|m joins |
03:39:00 | | tomodachi94 (tomodachi94) joins |
03:39:00 | | Fletcher (Fletcher) joins |
03:39:00 | | Thibaultmol joins |
03:39:00 | | finalti|m joins |
03:39:00 | | lasdkfj|m joins |
03:39:00 | | Minkafighter|m joins |
03:39:00 | | GRBaset (GRBaset) joins |
03:39:00 | | Froxcey|m joins |
03:39:00 | | Sanqui|m (Sanqui) joins |
03:39:00 | | masterx244|m joins |
03:39:00 | | akaibu|m joins |
03:39:00 | | DigitalDragon (DigitalDragon) joins |
03:39:00 | | NickS|m joins |
03:39:00 | | igneousx (igneousx) joins |
03:39:00 | | Ajay joins |
03:39:00 | | audrooku|m joins |
03:39:00 | | schwarzkatz|m joins |
03:39:00 | | ragu|m joins |
03:39:00 | | britmob|m joins |
03:39:00 | | madpro|m joins |
03:39:00 | | x9fff00 (x9fff00) joins |
03:39:00 | | jackt1365|m joins |
03:39:00 | | t3chler|m joins |
03:39:00 | | haha-whered-it-go|m joins |
03:39:00 | | saouroun|m joins |
03:39:00 | | mind_combatant joins |
03:39:00 | | nstrom|m joins |
03:39:00 | | theblazehen|m joins |
03:39:00 | | andrewvieyra|m joins |
03:39:00 | | hlgs|m joins |
03:39:00 | | xxia|m joins |
03:39:00 | | mpeter|m joins |
03:39:00 | | thermospheric (Thermospheric) joins |
03:39:00 | | mikolaj|m joins |
03:39:00 | | Ruk8 (Ruk8) joins |
03:39:00 | | MinePlayersPEMyNey|m joins |
03:39:00 | | tech234a|m joins |
03:39:00 | | ing.hackint.org sets mode: +o Sanqui|m |
03:39:00 | | pannekoek11|m joins |
03:39:00 | | joepie91|m joins |
03:39:00 | | rewby|m joins |
03:39:05 | | coderobe quits [Max SendQ exceeded] |
03:39:26 | | coderobe (coderobe) joins |
03:40:05 | <nicolas17> | Ryz: I think it's all private content sooo |
03:41:25 | <lindowsME> | Hey were the videos from funnyordie.com saved? |
03:41:25 | <lindowsME> | they're not on the site anymore (since years ago), but seem to all still be on S3. the old cdn used to redirect, now 404s. |
03:41:35 | <lindowsME> | https://web.archive.org/cdx/search/cdx?url=vo.fod4.com/v/*&limit=1000 |
03:41:41 | <lindowsME> | https://web.archive.org/cdx/search/cdx?url=http://s3.amazonaws.com/production.videos.funnyordie.com/v/*&limit=1000 |
03:42:11 | <lindowsME> | archive.org 403s |
03:54:43 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
03:54:51 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
03:56:18 | | qwertyasdfuiopghjkl quits [Excess Flood] |
03:58:12 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
04:05:35 | | kitonthe2et quits [Ping timeout: 272 seconds] |
04:05:55 | | kitonthe1et joins |
04:11:17 | | kitonthe1et quits [Ping timeout: 272 seconds] |
04:12:56 | | M--mlv|m quits [*.net *.split] |
04:12:58 | | marius851000 quits [*.net *.split] |
04:12:58 | | that_lurker|m quits [*.net *.split] |
04:12:58 | | hillow596|m quits [*.net *.split] |
04:12:58 | | Nulo|m quits [*.net *.split] |
04:12:58 | | sonst-was|m quits [*.net *.split] |
04:12:58 | | EvanBoehs|m quits [*.net *.split] |
04:12:58 | | qq44|m quits [*.net *.split] |
04:12:58 | | alexshpilkin quits [*.net *.split] |
04:12:58 | | Misty|m quits [*.net *.split] |
04:12:58 | | username675f|m quits [*.net *.split] |
04:12:58 | | AntoninDelFabbro|m quits [*.net *.split] |
04:12:58 | | Maakuth|m quits [*.net *.split] |
04:12:58 | | Peetz0r|m quits [*.net *.split] |
04:12:58 | | EmeraldSnorlax|m quits [*.net *.split] |
04:12:58 | | kaz__|m quits [*.net *.split] |
04:12:58 | | ram|m quits [*.net *.split] |
04:12:58 | | Passiing|m quits [*.net *.split] |
04:12:58 | | noxious quits [*.net *.split] |
04:12:58 | | yetanotherarchiver|m quits [*.net *.split] |
04:12:58 | | trumad|m quits [*.net *.split] |
04:12:58 | | NickS|m quits [*.net *.split] |
04:12:58 | | haha-whered-it-go|m quits [*.net *.split] |
04:12:58 | | joepie91|m quits [*.net *.split] |
04:12:58 | | gwetchen|m quits [*.net *.split] |
04:12:58 | | superusercode quits [*.net *.split] |
04:12:58 | | noobirc|m quits [*.net *.split] |
04:12:58 | | lasdkfj|m quits [*.net *.split] |
04:12:58 | | GRBaset quits [*.net *.split] |
04:12:58 | | nyuuzyou quits [*.net *.split] |
04:12:58 | | jevinskie quits [*.net *.split] |
04:12:59 | | will|m quits [*.net *.split] |
04:12:59 | | JC|m quits [*.net *.split] |
04:12:59 | | pannekoek11|m quits [*.net *.split] |
04:12:59 | | jwoglom|m quits [*.net *.split] |
04:12:59 | | gungagungagunga|m quits [*.net *.split] |
04:12:59 | | coro quits [*.net *.split] |
04:12:59 | | Cydog|m quits [*.net *.split] |
04:13:00 | | t3chler|m quits [*.net *.split] |
04:13:00 | | akaibu|m quits [*.net *.split] |
04:13:00 | | qyxojzh|m quits [*.net *.split] |
04:13:00 | | hlgs|m quits [*.net *.split] |
04:13:00 | | thermospheric quits [*.net *.split] |
04:13:00 | | vexr quits [*.net *.split] |
04:13:00 | | Video quits [*.net *.split] |
04:13:00 | | Ruk8 quits [*.net *.split] |
04:13:00 | | iCesenberk|m quits [*.net *.split] |
04:13:00 | | octylFractal|m quits [*.net *.split] |
04:13:00 | | wrangle|m quits [*.net *.split] |
04:13:00 | | nosamu|m quits [*.net *.split] |
04:13:00 | | Max|m12 quits [*.net *.split] |
04:13:00 | | masterx244|m quits [*.net *.split] |
04:13:00 | | voltagex|m quits [*.net *.split] |
04:13:00 | | mikolaj|m quits [*.net *.split] |
04:13:00 | | tech234a|m quits [*.net *.split] |
04:13:00 | | Roki_100|m quits [*.net *.split] |
04:13:00 | | finalti|m quits [*.net *.split] |
04:13:01 | | moe-a-m|m quits [*.net *.split] |
04:13:01 | | schwarzkatz|m quits [*.net *.split] |
04:13:01 | | jackt1365|m quits [*.net *.split] |
04:13:01 | | saouroun|m quits [*.net *.split] |
04:13:01 | | madpro|m quits [*.net *.split] |
04:13:01 | | Minkafighter|m quits [*.net *.split] |
04:13:01 | | yzqzss quits [*.net *.split] |
04:13:01 | | x9fff00 quits [*.net *.split] |
04:13:01 | | Exorcism quits [*.net *.split] |
04:13:01 | | phaeton quits [*.net *.split] |
04:13:01 | | Tom|m1 quits [*.net *.split] |
04:13:01 | | Froxcey|m quits [*.net *.split] |
04:13:01 | | manu|m quits [*.net *.split] |
04:13:01 | | s-crypt|m|m quits [*.net *.split] |
04:13:01 | | CrispyAlice2 quits [*.net *.split] |
04:13:02 | | Fletcher quits [*.net *.split] |
04:13:02 | | ragu|m quits [*.net *.split] |
04:13:02 | | flashfire42|m quits [*.net *.split] |
04:13:02 | | MinePlayersPEMyNey|m quits [*.net *.split] |
04:13:02 | | Thibaultmol quits [*.net *.split] |
04:13:02 | | nstrom|m quits [*.net *.split] |
04:13:02 | | Hans5958 quits [*.net *.split] |
04:13:02 | | theblazehen|m quits [*.net *.split] |
04:13:02 | | mpeter|m quits [*.net *.split] |
04:13:02 | | rewby|m quits [*.net *.split] |
04:13:02 | | tomodachi94 quits [*.net *.split] |
04:13:02 | | Vokun quits [*.net *.split] |
04:13:02 | | audrooku|m quits [*.net *.split] |
04:13:02 | | britmob|m quits [*.net *.split] |
04:13:02 | | xxia|m quits [*.net *.split] |
04:13:02 | | cmostracker|m quits [*.net *.split] |
04:13:02 | | andrewvieyra|m quits [*.net *.split] |
04:13:02 | | DigitalDragon quits [*.net *.split] |
04:13:02 | | mind_combatant quits [*.net *.split] |
04:13:02 | | @Sanqui|m quits [*.net *.split] |
04:13:02 | | igneousx quits [*.net *.split] |
04:13:02 | | Ajay quits [*.net *.split] |
04:16:50 | | kitonthe1et joins |
04:21:50 | | kitonthe1et quits [Ping timeout: 240 seconds] |
04:34:02 | | kitonthenet joins |
04:40:25 | | kitonthenet quits [Ping timeout: 272 seconds] |
05:35:45 | | Megame quits [Client Quit] |
06:19:13 | | Irene quits [Quit: WeeChat 3.8] |
06:26:49 | | Irenes (ireneista) joins |
06:35:20 | | lflare quits [Ping timeout: 240 seconds] |
06:36:50 | | wickedplayer494 is now authenticated as wickedplayer494 |
06:44:27 | | DogsRNice quits [Read error: Connection reset by peer] |
06:44:32 | | lflare (lflare) joins |
07:04:08 | | hitgrr8 joins |
08:34:52 | | Doran (Doranwen) joins |
08:35:23 | | Doranwen quits [Ping timeout: 272 seconds] |
08:46:53 | | decky_e joins |
08:49:57 | | decky quits [Ping timeout: 272 seconds] |
09:34:53 | | qwertyasdfuiopghjkl quits [Client Quit] |
09:39:49 | | Wohlstand (Wohlstand) joins |
09:48:28 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
10:00:01 | | Bleo1826 quits [Client Quit] |
10:00:26 | | Wohlstand quits [Client Quit] |
10:01:21 | | Bleo1826 joins |
10:08:43 | | Island quits [Read error: Connection reset by peer] |
10:52:54 | | neggles quits [Quit: bye friends - ZNC - https://znc.in] |
11:00:16 | | parfait_ quits [Quit: Leaving] |
11:30:34 | | BornOn420_ quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
11:35:45 | | BornOn420 (BornOn420) joins |
11:39:08 | | decky_e quits [Read error: Connection reset by peer] |
11:39:48 | | decky_e joins |
11:52:18 | | Inti83 joins |
11:53:12 | <Inti83> | Hi, I am here with a request similar to EndOfTerm archive but for Argentina, as the incoming government has already stated it's intent to dismantle most agencies |
11:54:10 | <Inti83> | We are already working on archiving the data by downloading it as wee understand archive.org doesn't necessarily automatically index all pages. We understand ArchiveBot helps with this |
11:54:36 | <Inti83> | The new term starts on 10th of December |
11:55:20 | <Inti83> | We have compiled a list of sites which is not exchaustive |
11:55:36 | <Inti83> | I found argentina.gob.ar and educ.ar in the archive but there are quite a few more that are not |
11:56:33 | <Inti83> | It would be prefereable to have the sites in archive.org rather than just downloading for preservation as this ensures public access to all whereas the distribution aspect after downloading is complex |
12:06:42 | <Inti83> | Some of the content is multimedia and we are having a hard time knowing how to archive it |
12:06:57 | <Inti83> | Example https://www.cont.ar/ |
12:14:41 | <@Sanqui> | Inti83: Hello, please stick around, we're definitely able to help with this. |
12:15:27 | <@Sanqui> | ArchiveBot is able to crawl and download many websites, which then get uploaded to the Internet Archive and become possible to browse in the Wayback Machine |
12:15:35 | <@Sanqui> | It has its limitations though |
12:15:54 | <@Sanqui> | A good start would be to create a page on the wiki with a list of websites, then we can make notes for if individual websites were successful to crawl with ArchiveBot |
12:16:11 | <@Sanqui> | (BTW, I can't access cont.ar at all over here at Europe. I'm getting a Cloudflare block page.) |
12:17:13 | <Inti83> | Hi, cloudflare may be a problem, we encountered some problems with this even at this end |
12:17:32 | <Inti83> | OK, I'll get started on a wiki page |
12:28:18 | | neggles (neggles) joins |
12:33:13 | <Inti83> | What is a good nomeclature for such a page? |
12:38:47 | | Inti83 quits [Ping timeout: 243 seconds] |
12:47:59 | | Inti joins |
12:56:00 | | Inti quits [Remote host closed the connection] |
13:10:20 | | Arcorann quits [Ping timeout: 240 seconds] |
13:13:27 | | Wohlstand (Wohlstand) joins |
13:25:20 | | kiryu quits [Ping timeout: 240 seconds] |
13:27:51 | | kiryu joins |
13:27:51 | | kiryu is now authenticated as kiryu |
13:27:51 | | kiryu quits [Changing host] |
13:27:51 | | kiryu (kiryu) joins |
13:29:49 | | Ketchup901 (Ketchup901) joins |
13:44:27 | | Inti83 joins |
13:45:21 | <Inti83> | Hi, I keep getting disconnected. I wrote earlier about an End Of Term archive for Argentina. I was wondering how to follow nomenclature norms in order to start a new page and add the links as suggested? |
13:45:44 | <Inti83> | There's a Government Backup page but it is US based |
13:48:06 | <thuban> | Inti83: "Argentina" is fine (we have a number of country pages, and can always add other sections/subpages as needed) |
13:48:21 | <Inti83> | ok |
14:03:00 | | eroc1990 quits [Client Quit] |
14:03:24 | | eroc1990 (eroc1990) joins |
14:18:45 | <Inti83> | Hey, OK. I sent the page for review |
14:19:13 | <Inti83> | It has a list of pages we have compiled so far as relevant, although we have issued out a call so will be most likely adding more |
14:20:55 | <Inti83> | I'll likely get disconnected again soon but I will connect again when poss |
14:21:07 | | Inti83 quits [Remote host closed the connection] |
14:33:06 | | Inti83 joins |
14:51:42 | | kitonthenet joins |
14:56:20 | | kitonthenet quits [Ping timeout: 240 seconds] |
15:01:02 | <@rewby|backup> | Inti83: I've approved your page. |
15:01:09 | <Inti83> | Thank you! |
15:01:29 | <h2ibot> | Inti83 created Argentina (+4734, Add cultural links & YT): https://wiki.archiveteam.org/?title=Argentina |
15:05:40 | <Inti83> | Thanks, we are going to test using grab-site to save these. If this works, do we consider the site saved? Or how do you usually proceeed in these cases? This tool: https://github.com/ArchiveTeam/grab-site |
15:28:00 | <TheTechRobo> | Inti83: Generally, WARCs from most people won't get added to the Wayback Machine as there is the possibility of tampering. But if it works with grab-site, it will almost certainly work with ArchiveBot as they share the same crawling code |
15:35:55 | | sepro quits [Ping timeout: 272 seconds] |
15:36:16 | | icedice (icedice) joins |
15:47:46 | | kitonthenet joins |
15:50:20 | | nfriedly quits [Ping timeout: 240 seconds] |
15:58:05 | | Earendil7 quits [Ping timeout: 272 seconds] |
16:10:20 | | kitonthenet quits [Ping timeout: 240 seconds] |
16:23:16 | | Inti83 quits [Remote host closed the connection] |
16:23:47 | | Inti83 joins |
16:23:50 | | sepro (sepro) joins |
16:34:00 | | Earendil7 (Earendil7) joins |
16:38:21 | | kitonthenet joins |
16:42:03 | <Naruyoko> | https://abcnews.go.com/Business/google-begins-process-deleting-inactive-gmail-accounts/story?id=105281283 |
16:42:24 | <Naruyoko> | Have anyone noticed this? Google will start deleting inactive accounts. |
16:49:50 | <Naruyoko> | (I wasn't in #googlecrash, so I can't see history) |
16:51:59 | | nfriedly joins |
16:55:50 | | kitonthenet quits [Ping timeout: 240 seconds] |
17:09:11 | | kitonthe2et joins |
17:14:50 | | kitonthe2et quits [Ping timeout: 240 seconds] |
17:28:01 | | aninternettroll quits [Ping timeout: 272 seconds] |
17:33:43 | | Inti83 quits [Remote host closed the connection] |
17:37:45 | | kitonthe1et joins |
17:41:52 | | aninternettroll (aninternettroll) joins |
17:46:23 | | kitonthe1et quits [Ping timeout: 272 seconds] |
17:47:14 | | kitonthenet joins |
17:55:15 | | kitonthenet quits [Ping timeout: 272 seconds] |
18:03:42 | | inti83 joins |
18:08:09 | | kitonthe1et joins |
18:15:26 | <nulldata> | Naruyoko - Yeah, that is what is prompted the grab for Blogger. #frogger |
18:21:02 | | soap joins |
18:25:15 | <Naruyoko> | I see |
18:26:54 | <soap> | I have a list of ~2000 or so cdn.discordapp.com urls from wiki.tockdom.com, would someone mind adding them to archivebot for me?https://transfer.archivete.am/j5cLW/tockdom_discord_urls.txt |
18:26:55 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/j5cLW/tockdom_discord_urls.txt |
18:27:21 | <soap> | or is there something else I should do with them? |
18:27:49 | <@JAA> | soap: Sure, I'll throw them in. |
18:28:41 | <soap> | thanks! |
18:29:52 | | soap leaves |
18:36:23 | | DogsRNice joins |
18:39:12 | | sec^nd quits [Remote host closed the connection] |
18:39:32 | | sec^nd (second) joins |
19:06:00 | | sec^nd quits [Remote host closed the connection] |
19:06:12 | | sec^nd (second) joins |
19:18:01 | | qwertyasdfuiopghjkl quits [Client Quit] |
19:21:20 | | Wohlstand quits [Ping timeout: 240 seconds] |
19:24:21 | <nicolas17> | JAA: what are the requirements for an archivebot pipeline? |
19:25:18 | <nicolas17> | if there's more .ar sites blocked so they only work from Argentina, it could be a problem |
19:25:30 | <nicolas17> | "I can't access cont.ar at all over here at Europe. I'm getting a Cloudflare block page." |
19:26:33 | <inti83> | cont.ar and cine.ar have user only content which may be why |
19:27:49 | <@JAA> | nicolas17: Right. Stable machine with uptime measured at least in the months. Clean network. For the hardware, SSDs are basically required to operate at an acceptable speed, but otherwise, things can be scaled to fit what's available; the ideal machine would have a good number of CPU cores/threads. |
19:28:21 | <@JAA> | RAM is rarely relevant, but more is better for caching. |
19:28:30 | <nicolas17> | running an archivebot crawler from a .ar IP would help with those cases, I doubt I can offer hardware for that but maybe I (or inti83) can find people who can? |
19:29:08 | <@JAA> | For a more targeted project rather than a general pipeline, the uptime requirement would be less strict, I suppose. |
19:29:24 | <inti83> | is that something like grab-site? |
19:29:27 | <@JAA> | Since we'll want to archive these things within weeks anyway. |
19:29:35 | <inti83> | yes; i think i can find people |
19:29:40 | <inti83> | what do i need to do? |
19:29:41 | <@JAA> | grab-site is essentially a local version of ArchiveBot. |
19:30:56 | <@JAA> | 'Local' as in 'not distributed'; AB has a control node to coordinate the different machines (pipelines). |
19:31:15 | <inti83> | how would we run the archivebot from here? |
19:32:03 | <nicolas17> | I think I know people with servers inside the Cabase IXP :D |
19:33:54 | <inti83> | cool let me know and i can ask people who are on this whether they have the hardware capacity, may be possible |
19:34:21 | <@JAA> | So the AB setup is fairly messy, and the install notes aren't entirely complete I think. If I can get access to a suitable server provided by a trustworthy party, I can set it up. |
19:35:03 | <pokechu22> | I assume this would be set up as a matchonly pipeline (though probably without matchonly in the name so that -p matchonly doesn't hit it), to avoid long-running jobs accidentally ending up on it? |
19:35:48 | <@JAA> | Yes |
20:03:51 | | BlueMaxima joins |
20:14:09 | | riku quits [Quit: WeeChat 4.1.1] |
20:14:30 | | riku (riku) joins |
20:17:20 | <nicolas17> | I just found something interesting for future data-analysis purposes, archive.org has "access-control-allow-origin: *", so you can make client-side JS code to eg. get a cdx file and process it and return the extracted data, and do distributed computing by just giving people a link, kind of like the imgur bruteforce thing :D |
20:45:07 | <inti83> | do you have any tips on archiving atom archives? we are having some trouble: https://share.riseup.net/#G_1seXPsbK1wKVUwdMCNpw |
20:47:30 | | wickedplayer494 quits [Remote host closed the connection] |
20:50:37 | <inti83> | so many links |
20:51:24 | <pokechu22> | That probably needs ignores of some sort but I don't have any specific recomendations |
20:53:28 | <inti83> | yeah, sadly this endpoint is used for everything: it always goes through it :/ |
20:53:52 | <@JAA> | It looks like there is filter faceting, but that might not be the only thing. |
20:57:44 | | IDK (IDK) joins |
21:01:30 | | wickedplayer494 joins |
21:01:40 | | wickedplayer494 is now authenticated as wickedplayer494 |
21:15:16 | | andrew quits [Client Quit] |
21:18:39 | | rewby|m joins |
21:21:50 | | joepie91|m joins |
21:21:50 | | pannekoek11|m joins |
21:21:50 | | MinePlayersPEMyNey|m joins |
21:21:50 | | Sanqui|m (Sanqui) joins |
21:21:50 | | Ruk8 (Ruk8) joins |
21:21:50 | | @ChanServ sets mode: +o Sanqui|m |
21:21:50 | | mikolaj|m joins |
21:21:50 | | thermospheric (Thermospheric) joins |
21:21:50 | | mpeter|m joins |
21:21:50 | | hlgs|m joins |
21:21:50 | | tech234a|m joins |
21:21:50 | | andrewvieyra|m joins |
21:21:51 | | xxia|m joins |
21:21:51 | | theblazehen|m joins |
21:21:51 | | mind_combatant joins |
21:21:51 | | nstrom|m joins |
21:21:51 | | haha-whered-it-go|m joins |
21:21:51 | | saouroun|m joins |
21:21:51 | | t3chler|m joins |
21:21:51 | | jackt1365|m joins |
21:21:51 | | schwarzkatz|m joins |
21:21:51 | | madpro|m joins |
21:21:51 | | x9fff00 (x9fff00) joins |
21:21:51 | | ragu|m joins |
21:21:51 | | britmob|m joins |
21:21:51 | | audrooku|m joins |
21:21:51 | | Ajay joins |
21:21:51 | | igneousx (igneousx) joins |
21:21:51 | | NickS|m joins |
21:21:51 | | akaibu|m joins |
21:21:51 | | masterx244|m joins |
21:21:51 | | Froxcey|m joins |
21:21:51 | | GRBaset (GRBaset) joins |
21:21:51 | | DigitalDragon (DigitalDragon) joins |
21:21:51 | | Minkafighter|m joins |
21:21:51 | | lasdkfj|m joins |
21:21:51 | | finalti|m joins |
21:21:51 | | Thibaultmol joins |
21:21:51 | | manu|m joins |
21:21:51 | | tomodachi94 (tomodachi94) joins |
21:21:52 | | Fletcher (Fletcher) joins |
21:21:52 | | voltagex|m joins |
21:21:52 | | yzqzss (yzqzss) joins |
21:21:52 | | CrispyAlice2 (CrispyAlice2) joins |
21:21:52 | | gwetchen|m joins |
21:21:52 | | Hans5958 (Hans5958) joins |
21:21:52 | | moe-a-m|m joins |
21:21:52 | | wrangle|m joins |
21:21:52 | | vexr joins |
21:21:52 | | superusercode (superusercode) joins |
21:21:52 | | nyuuzyou (nyuuzyou) joins |
21:21:52 | | Max|m12 joins |
21:21:52 | | jwoglom|m joins |
21:21:52 | | Cydog|m joins |
21:21:52 | | nosamu|m joins |
21:21:52 | | Roki_100|m joins |
21:21:52 | | trumad|m joins |
21:21:52 | | Tom|m1 joins |
21:21:52 | | gungagungagunga|m joins |
21:21:52 | | cmostracker|m joins |
21:21:52 | | noobirc|m joins |
21:21:52 | | yetanotherarchiver|m joins |
21:21:52 | | Video joins |
21:21:52 | | Vokun (Vokun) joins |
21:21:52 | | phaeton (phaeton) joins |
21:21:52 | | coro joins |
21:21:52 | | flashfire42|m joins |
21:21:52 | | octylFractal|m joins |
21:21:52 | | qyxojzh|m joins |
21:21:52 | | iCesenberk|m joins |
21:21:53 | | s-crypt|m|m joins |
21:21:53 | | JC|m joins |
21:21:53 | | Exorcism (exorcism) joins |
21:22:03 | | Passiing|m joins |
21:22:03 | | ram|m joins |
21:22:03 | | Peetz0r|m joins |
21:22:04 | | noxious joins |
21:22:04 | | EmeraldSnorlax|m joins |
21:22:04 | | kaz__|m joins |
21:22:04 | | Maakuth|m joins |
21:22:05 | | AntoninDelFabbro|m joins |
21:22:05 | | jevinskie (jevinskie) joins |
21:22:05 | | will|m joins |
21:22:06 | | username675f|m joins |
21:22:20 | | alexshpilkin joins |
21:22:20 | | EvanBoehs|m joins |
21:22:20 | | Misty|m joins |
21:22:20 | | qq44|m joins |
21:22:21 | | hillow596|m joins |
21:22:21 | | sonst-was|m joins |
21:22:21 | | that_lurker|m joins |
21:22:21 | | Nulo|m joins |
21:22:21 | | marius851000 joins |
21:24:22 | | inti83 quits [Remote host closed the connection] |
21:28:03 | | wickedplayer494 quits [Ping timeout: 272 seconds] |
21:34:22 | | andrew (andrew) joins |
21:38:26 | | wickedplayer494 joins |
21:38:35 | | wickedplayer494 is now authenticated as wickedplayer494 |
21:45:30 | | wickedplayer494 quits [Read error: Connection reset by peer] |
21:48:19 | | wickedplayer494 joins |
21:48:29 | | wickedplayer494 is now authenticated as wickedplayer494 |
21:54:52 | | aninternettroll quits [Read error: Connection reset by peer] |
21:54:54 | | aninternettroll (aninternettroll) joins |
22:04:26 | | Island joins |
22:09:17 | | hitgrr8 quits [Client Quit] |
22:49:31 | <AK> | Woot possibly more AB pipelines? |
22:49:33 | | M--mlv|m joins |
23:06:46 | | IDK quits [Client Quit] |
23:21:32 | | Arcorann (Arcorann) joins |
23:51:35 | | Lord_Nightmare quits [Quit: ZNC - http://znc.in] |
23:55:34 | | Lord_Nightmare (Lord_Nightmare) joins |