| 00:02:12 | | midou joins |
| 00:05:12 | | Dada quits [Remote host closed the connection] |
| 00:07:29 | | Hackerpcs quits [Ping timeout: 272 seconds] |
| 00:08:01 | | Hackerpcs (Hackerpcs) joins |
| 00:12:06 | | nepeat quits [Ping timeout: 256 seconds] |
| 00:13:48 | | midou quits [Ping timeout: 256 seconds] |
| 00:16:04 | | mrminemeet quits [Ping timeout: 256 seconds] |
| 00:17:29 | | mrminemeet joins |
| 00:23:29 | <fuzzy80211> | got one box thats been banned for 2 hrs |
| 00:24:05 | | midou joins |
| 00:46:06 | | wickedplayer494 quits [Ping timeout: 256 seconds] |
| 00:47:13 | | nepeat (nepeat) joins |
| 00:47:52 | | wickedplayer494 (wickedplayer494) joins |
| 00:50:08 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
| 00:51:44 | | Rejoin_HP_Archivist quits [Quit: Leaving] |
| 00:52:08 | | HP_Archivist (HP_Archivist) joins |
| 00:52:30 | <HP_Archivist> | JAA RE: Greenland. I think now's the time for that if nothing has been done yet. |
| 00:52:40 | <HP_Archivist> | Sorry it's taken me some time to circle back. Busy with classes. |
| 00:53:53 | | MrMcNuggets (MrMcNuggets) joins |
| 00:54:36 | | wickedplayer494 quits [Ping timeout: 256 seconds] |
| 00:55:34 | | wickedplayer494 (wickedplayer494) joins |
| 00:55:37 | <@JAA> | HP_Archivist: I agree. |
| 00:55:41 | <@JAA> | arkiver: All of .gl into #//? |
| 00:56:35 | <@JAA> | Apparently, there are only a few thousand domains. |
| 00:56:59 | <HP_Archivist> | What about cultural institutions that aren't .gov? |
| 00:57:14 | <HP_Archivist> | Or. the equivalent of .gov there |
| 00:57:25 | <@JAA> | We'd need a list. |
| 00:57:56 | <HP_Archivist> | Let me see if I can do some searching and compile some links later tonight |
| 00:58:02 | <@JAA> | The country might be small enough that someone can easily compile that by hand though. |
| 00:58:14 | <h2ibot> | Nyakase edited Tenor (+1389, Document discovery via tenor.com, link to API docs): https://wiki.archiveteam.org/?diff=60153&oldid=60143 |
| 01:05:17 | <eggdrop> | [remind] Guest: maxmodels unban |
| 01:07:45 | <nyakase> | if the api is going to be scraped for tenor too, as mentioned ^ tenor.com hits on the api after the cache. the api key is a query parameter, so just check devtools network tab and you have an official key to use there |
| 01:15:56 | <HP_Archivist> | Noted JAA. I'll have a look in a bit. |
| 01:41:09 | | sec^nd quits [Remote host closed the connection] |
| 01:41:30 | <Guest> | after almost 4hrs still banned on maxmodels |
| 01:41:33 | | sec^nd (second) joins |
| 01:51:17 | <HP_Archivist> | JAA: https://transfer.archivete.am/IAgtE/kalaallit-nunaat-kulturikkut-ingerlatsiviit-domains-2026-01-14.txt |
| 01:51:18 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/IAgtE/kalaallit-nunaat-kulturikkut-ingerlatsiviit-domains-2026-01-14.txt |
| 01:52:10 | <HP_Archivist> | A somewhat exhaustive list of cultural institutions in Greenland. I made sure they were all reachable. They should be good. One or two were blocked, but that could be my school's network. Or a region block. |
| 01:57:53 | | Guest58 quits [Read error: Connection reset by peer] |
| 01:57:57 | | Guest58_ joins |
| 01:58:12 | | Guest58_ quits [Read error: Connection reset by peer] |
| 01:58:44 | | Guest58 joins |
| 02:02:45 | | midou quits [Ping timeout: 272 seconds] |
| 02:19:16 | | Guest58_ joins |
| 02:20:29 | | Guest58 quits [Ping timeout: 272 seconds] |
| 02:20:42 | | Guest58__ joins |
| 02:24:17 | | Guest58_ quits [Ping timeout: 272 seconds] |
| 02:26:10 | | Guest58__ quits [Client Quit] |
| 02:28:31 | <h2ibot> | PaulWise edited Phorge (-2, Bugzilla is solely a bug tracker,…): https://wiki.archiveteam.org/?diff=60154&oldid=60125 |
| 02:33:59 | <klea> | pabs: thanks for that edit :) |
| 02:36:53 | <nicolas17> | maxmodels ETA 23.5 hours (Jan 16 02:05) |
| 02:37:03 | | Guest58 joins |
| 02:41:07 | | Guest58 quits [Read error: Connection reset by peer] |
| 02:55:13 | | midou joins |
| 03:00:23 | | midou quits [Ping timeout: 272 seconds] |
| 03:01:23 | <Guest> | still banned, for 5.5 hours |
| 03:07:11 | <ericgallager> | Some of the links here might be in danger: https://bsky.app/profile/rosemarierung.bsky.social/post/3mcf4kfap6c2f |
| 03:08:51 | <klea> | ericgallager: probably for #UncleSamsArchive ? |
| 03:08:52 | <pokechu22> | https://www.trammellcrow.com/newsroom/tcc-breaks-ground-on-323-750-sf-industrial-development-in-new-hampshire is gone now, while the other links appear to be broken |
| 03:09:20 | <pokechu22> | (existed since december 2023 and previously captured) |
| 03:13:14 | | Webuser708906 joins |
| 03:13:52 | <ericgallager> | klea: idk, kinda borderline... the trammellcrow website links are a private organization... |
| 03:13:57 | <klea> | oh |
| 03:14:02 | <klea> | sorry |
| 03:16:07 | | Webuser708906 quits [Client Quit] |
| 03:16:28 | | midou joins |
| 03:17:22 | <ericgallager> | no it's ok, it really is a borderline case that I wasn't sure about myself... |
| 03:23:49 | | midou quits [Ping timeout: 272 seconds] |
| 03:34:39 | <h2ibot> | Hans5958 edited Maxmodels.pl (+1671, Expand): https://wiki.archiveteam.org/?diff=60155&oldid=60152 |
| 03:36:39 | <h2ibot> | Hans5958 edited Maxmodels.pl (+85, Add more reference): https://wiki.archiveteam.org/?diff=60156&oldid=60155 |
| 03:38:39 | <h2ibot> | Hans5958 edited Bitbucket (+56, cat): https://wiki.archiveteam.org/?diff=60157&oldid=60049 |
| 03:39:32 | <nicolas17> | Hans5958++ |
| 03:39:33 | <eggdrop> | [karma] 'Hans5958' now has 3 karma! |
| 03:39:40 | <h2ibot> | Hans5958 edited Bufftoon (+51, cat): https://wiki.archiveteam.org/?diff=60158&oldid=58684 |
| 03:39:41 | <h2ibot> | Hans5958 edited EyeEm (+30, cat): https://wiki.archiveteam.org/?diff=60159&oldid=59479 |
| 03:40:40 | <h2ibot> | Hans5958 edited Tistory (+65, cat): https://wiki.archiveteam.org/?diff=60160&oldid=59093 |
| 03:40:41 | <h2ibot> | Hans5958 edited Microsoft Update (-2, cat): https://wiki.archiveteam.org/?diff=60161&oldid=58897 |
| 03:40:42 | <h2ibot> | Hans5958 edited Maxmodels.pl (+30, cat): https://wiki.archiveteam.org/?diff=60162&oldid=60156 |
| 03:41:40 | <h2ibot> | Hans5958 edited Meta Ad Library (+30, cat): https://wiki.archiveteam.org/?diff=60163&oldid=59014 |
| 03:42:40 | <h2ibot> | Hans5958 edited SourceForge (+30, cat): https://wiki.archiveteam.org/?diff=60164&oldid=59202 |
| 03:42:41 | <h2ibot> | Hans5958 edited Oshiete! Goo (+31, cat): https://wiki.archiveteam.org/?diff=60165&oldid=57440 |
| 03:42:42 | <h2ibot> | Hans5958 edited Typepad (+31, cat): https://wiki.archiveteam.org/?diff=60166&oldid=57758 |
| 03:42:43 | <h2ibot> | Hans5958 edited Peing (+31, cat): https://wiki.archiveteam.org/?diff=60167&oldid=57445 |
| 03:42:51 | <nicolas17> | meow |
| 03:43:40 | <h2ibot> | Hans5958 edited 짱공유닷컴 (+31, cat): https://wiki.archiveteam.org/?diff=60168&oldid=59035 |
| 03:44:40 | <h2ibot> | Hans5958 edited Clyp (+31, cat): https://wiki.archiveteam.org/?diff=60169&oldid=55909 |
| 03:44:41 | <h2ibot> | Hans5958 edited 竹白 (+31, cat): https://wiki.archiveteam.org/?diff=60170&oldid=55718 |
| 03:44:42 | <h2ibot> | Hans5958 edited SSブログ (+31, cat): https://wiki.archiveteam.org/?diff=60171&oldid=56683 |
| 03:45:40 | <h2ibot> | Hans5958 edited Retrospring (+31, cat): https://wiki.archiveteam.org/?diff=60172&oldid=59722 |
| 03:45:41 | <h2ibot> | Hans5958 edited Posts.cv (+31, cat): https://wiki.archiveteam.org/?diff=60173&oldid=57388 |
| 03:45:42 | <h2ibot> | Hans5958 edited National Archives Catalog (+30, cat): https://wiki.archiveteam.org/?diff=60174&oldid=55885 |
| 03:45:43 | <h2ibot> | Hans5958 edited FC2 (+30, cat): https://wiki.archiveteam.org/?diff=60175&oldid=59033 |
| 03:45:44 | <h2ibot> | Hans5958 edited US Government (+60, cat): https://wiki.archiveteam.org/?diff=60176&oldid=58817 |
| 03:45:45 | <h2ibot> | Hans5958 edited Itch.io (+30, cat): https://wiki.archiveteam.org/?diff=60177&oldid=58782 |
| 03:45:46 | <h2ibot> | Hans5958 edited Glitch (+30, cat): https://wiki.archiveteam.org/?diff=60178&oldid=58344 |
| 03:45:47 | <h2ibot> | Hans5958 edited Livestream (+31, cat): https://wiki.archiveteam.org/?diff=60179&oldid=56839 |
| 03:46:40 | <h2ibot> | Hans5958 edited US Government (-60, cancel cat): https://wiki.archiveteam.org/?diff=60180&oldid=60176 |
| 03:47:25 | | midou joins |
| 03:48:44 | | Guest58 joins |
| 03:51:23 | | Guest58_ joins |
| 03:52:19 | | midou quits [Ping timeout: 272 seconds] |
| 03:53:07 | <klea> | Hans5958: if you wish to add the same category to a list of pages i can also do that in a way that won't spam irc, just give me a wikilike formatted list of pages, and a category name. |
| 03:53:35 | | Guest58 quits [Ping timeout: 272 seconds] |
| 03:53:53 | <klea> | also, we should add a years field or something to <https://wiki.archiveteam.org/index.php/Template:Infobox_project> |
| 03:54:44 | <Hans5958> | To be fair I |
| 03:54:45 | <Hans5958> | I'd rather use AWB by myself but I can't seem to make it work |
| 03:55:15 | <klea> | huh? |
| 03:55:17 | <klea> | oh |
| 03:55:24 | <Hans5958> | Maybe the policy is to use a separate bot acc? I could do that, but IDK how the policy goes |
| 03:55:28 | <klea> | yeah if you want to use AWB ask JAA to add your account the bot flag. |
| 03:55:53 | <klea> | Hans5958: i did it mostly because then i get easy separation between bulk automated edits and hand edits i do. |
| 03:56:28 | <klea> | Hans5958: i've had success in using JWB, if you want i can share some userscripts you could configure in some external to the wiki extension so they're loaded. |
| 03:56:55 | <Hans5958> | I see but I still want to pertain my edits in this acc as nonbot (and use the collapsing thing on IRC) |
| 03:57:35 | <klea> | then make second account probably |
| 03:57:48 | <Hans5958> | FYI: I'm not sure what happened but I could log in on AWB but editing with it doesn't work (submitting but fails?) |
| 03:59:38 | <klea> | strange |
| 04:00:12 | | wotd joins |
| 04:02:43 | | lflare quits [Quit: Bye] |
| 04:03:27 | | lflare (lflare) joins |
| 04:05:27 | <Hans5958> | !remind Hans5958 1d "Reminder (for everyone): Tell Google that "transfer.archivete.am" is safe on https://safebrowsing.google.com/safebrowsing/report-url" |
| 04:05:43 | <klea> | !remindme |
| 04:05:43 | <eggdrop> | [remind] !remindme "<time>" <message> (time is a format parseable by the TCL command 'clock scan' - https://www.tcl.tk/man/tcl8.6/TclCmd/clock.html) |
| 04:06:13 | <klea> | also, probably don't quote message |
| 04:08:48 | | midou joins |
| 04:10:23 | | Guest58_ quits [Client Quit] |
| 04:11:22 | | Guest58 joins |
| 04:13:12 | | Guest58 quits [Client Quit] |
| 04:14:04 | | Guest58 joins |
| 04:21:03 | | Island quits [Read error: Connection reset by peer] |
| 04:22:25 | | Webuser189333 joins |
| 04:23:10 | | Webuser189333 quits [Client Quit] |
| 04:29:29 | | Guest58_ joins |
| 04:33:29 | | Guest58 quits [Ping timeout: 272 seconds] |
| 04:53:29 | <pabs> | JAA: re the h2ibot wiki change announcements, would it be feasible to make initial page creations use URLs with the initial edit id in them? like https://wiki.archiveteam.org/index.php?title=Tenor&oldid=60142 or https://wiki.archiveteam.org/?title=Tenor&oldid=60142 or https://wiki.archiveteam.org/?oldid=60142 |
| 04:56:38 | | DogsRNice quits [Read error: Connection reset by peer] |
| 05:01:03 | | nexussfan (nexussfan) joins |
| 05:04:31 | | n9nes quits [Ping timeout: 272 seconds] |
| 05:06:07 | | n9nes joins |
| 05:22:42 | | Guest58_ quits [Client Quit] |
| 06:01:44 | | nexussfan quits [Client Quit] |
| 06:04:34 | | atphoenix_ quits [Ping timeout: 256 seconds] |
| 06:07:59 | <h2ibot> | Manu edited Distributed recursive crawls (+74, Candidates: Add russiancouncil.ru): https://wiki.archiveteam.org/?diff=60181&oldid=60067 |
| 06:11:27 | | atphoenix_ (atphoenix) joins |
| 06:40:00 | <nicolas17> | maxmodels ETA 19.5 hours (Jan 16 02:09) |
| 06:40:17 | <nicolas17> | item discovery still sticking to 0.8x |
| 06:41:47 | <@arkiver> | that one is running very smooth |
| 06:47:55 | <nicolas17> | I'm running at concurrency 5 on digitalocean, concurrency 7 at home, didn't ever get banned |
| 06:48:36 | <nicolas17> | I'm getting "There aren't any items available for this project at the moment." sometimes, I guess it's an artifact of moving items between queues? |
| 06:51:08 | <@arkiver> | no, it's rate limited |
| 07:05:05 | <nicolas17> | thought rate limits gave a different error |
| 07:05:29 | <nicolas17> | oh, pattern / item name dependent limits? |
| 07:07:55 | | Webuser14 joins |
| 07:08:36 | <Webuser14> | back to installing warrior, I had to hibernate my PC because in my timezone it was night, anyways now 66% |
| 07:08:51 | <Webuser14> | now waiting for payload to start |
| 07:09:06 | <Webuser14> | and now I should visit localhost |
| 07:09:54 | <Webuser14> | I cannot do it? |
| 07:10:32 | <Webuser14> | the error: |
| 07:10:33 | <Webuser14> | Hmmm… can't reach this page |
| 07:10:33 | <Webuser14> | It looks like the webpage at http://127.0.0.1:1/ might be having issues, or it may have moved permanently to a new web address. |
| 07:10:33 | <Webuser14> | ERR_UNSAFE_PORT |
| 07:10:47 | <nicolas17> | :1 does not look correct |
| 07:11:10 | <nicolas17> | but I don't know what port the warrior uses |
| 07:11:45 | <Webuser14> | oh, it said 8001 |
| 07:12:43 | <nicolas17> | http://127.0.0.1:8001/ should work then |
| 07:12:47 | <Webuser14> | yup |
| 07:13:00 | <Webuser14> | I didn't notice the line |
| 07:14:03 | <Webuser14> | my warrior nickname is webuser14 |
| 07:15:53 | <IDK> | Guest: sorry was asleep, no idea, I released that IP |
| 07:15:56 | <Webuser14> | I also have a wiki account: Webuser14 https://wiki.archiveteam.org/index.php/User:Webuser14 |
| 07:19:03 | <Webuser14> | how to fix "Login error |
| 07:19:03 | <Webuser14> | Incorrect or missing confirmation code." when signing up to just solve the file format problem? |
| 07:21:57 | | midou quits [Ping timeout: 272 seconds] |
| 07:30:49 | | monoxane quits [Ping timeout: 272 seconds] |
| 07:31:37 | | midou joins |
| 07:36:10 | | twiswist quits [Read error: Connection reset by peer] |
| 07:36:21 | | twiswist (twiswist) joins |
| 07:46:46 | | Webuser14 quits [Client Quit] |
| 07:48:30 | | monoxane (monoxane) joins |
| 07:50:51 | | monoxane quits [Read error: Connection reset by peer] |
| 07:51:00 | | monoxane4 (monoxane) joins |
| 07:51:00 | | midou quits [Read error: Connection reset by peer] |
| 07:52:17 | <h2ibot> | PaulWise edited Wikidot (+129, WikiComma): https://wiki.archiveteam.org/?diff=60182&oldid=59461 |
| 08:00:23 | | midou joins |
| 08:02:18 | <h2ibot> | Cruller edited Internet Archive/Save Page Now (+13, /* Blocks */ Add wp-admin): https://wiki.archiveteam.org/?diff=60183&oldid=59066 |
| 08:26:44 | | Guest58 joins |
| 08:27:45 | | AK quits [Quit: AK] |
| 08:58:51 | | rohvani quits [Ping timeout: 272 seconds] |
| 09:10:29 | | Guest58 quits [Client Quit] |
| 09:43:36 | <IDK> | It appears that if you are in europe, the concurrency need to be much lower |
| 09:44:20 | <IDK> | My workers in west coast US is going perfectly fine with 5 con for over 12 hours |
| 09:44:52 | <@arkiver> | note though that there is not much need to worry about maxmodels, it'll finish well in time. |
| 09:45:00 | <@arkiver> | so don't spend too much energy on it |
| 10:08:13 | | Webuse14 joins |
| 10:08:17 | <Webuse14> | I archived like 3 gb |
| 10:08:29 | <Webuse14> | oops misspelled my name |
| 10:08:41 | | Webuser149 joins |
| 10:08:45 | <Webuser149> | hi |
| 10:08:57 | | Webuser149 quits [Client Quit] |
| 10:09:03 | | Webuse14 quits [Client Quit] |
| 10:09:54 | | Webuser14 joins |
| 10:10:04 | <Webuser14> | so as I said, I archived like 3 gb |
| 10:10:21 | <Webuser14> | How do I make it so it no longer bloats my disk and to upload it to the archive |
| 10:11:55 | | Webuser14 quits [Client Quit] |
| 10:15:57 | | Webuser14 joins |
| 10:19:34 | | Sokar quits [Ping timeout: 256 seconds] |
| 10:21:36 | <masterx244|m> | warrior uploads a item as soon as it is downloaded. if a item disappes from the UI all data from it got sent to ATs intermediate infrastructure |
| 10:23:42 | | Sokar joins |
| 10:31:04 | | Webuser14 quits [Client Quit] |
| 10:56:23 | | Dada joins |
| 11:04:53 | | eggdrop quits [Ping timeout: 272 seconds] |
| 11:06:02 | | eggdrop (eggdrop) joins |
| 11:33:07 | | Webuser765058 joins |
| 11:33:13 | | Webuser765058 quits [Client Quit] |
| 11:44:16 | <nstrom|m> | looks most of my maxmodels workers got banned overnight at con 3, a few still working |
| 11:45:41 | <@arkiver> | nstrom|m: hmm they just did something |
| 11:45:52 | <@arkiver> | dropped from 225 URL/sec to 25 |
| 11:48:41 | <nstrom|m> | my ones that are still working are still working |
| 11:48:46 | <nstrom|m> | they're probably banning manually going off a list of top IPs or something |
| 11:50:20 | <nstrom|m> | still checking my workers via my advanced monitoring system of me manually looking at all the logs one server at a time |
| 11:51:49 | <IDK> | mine too |
| 11:52:03 | <IDK> | not subnet ban, just the IP address |
| 11:52:06 | | Kotomind joins |
| 11:53:03 | <IDK> | felt like a manual ban |
| 11:57:15 | <nstrom|m> | I have 6 workers still alive |
| 12:00:03 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
| 12:02:44 | | Bleo182600722719623455222 joins |
| 12:04:36 | <IDK> | eh, im gonna spin up some more |
| 12:46:11 | | Webuser224299 joins |
| 12:50:48 | | Webuser224299 quits [Client Quit] |
| 12:54:14 | | SootBector quits [Remote host closed the connection] |
| 12:55:21 | | SootBector (SootBector) joins |
| 13:09:28 | | sec^nd quits [Ping timeout: 256 seconds] |
| 13:11:33 | | UwU quits [Ping timeout: 272 seconds] |
| 13:14:28 | | sec^nd (second) joins |
| 13:15:07 | | UwU joins |
| 13:20:09 | | sec^nd quits [Remote host closed the connection] |
| 13:21:14 | | sec^nd (second) joins |
| 13:23:36 | | sec^nd quits [Remote host closed the connection] |
| 13:24:02 | | sec^nd (second) joins |
| 14:20:54 | | nomadgeek (nomadgeek) joins |
| 14:33:16 | | Webuser127665 joins |
| 14:33:28 | | Webuser127665 quits [Client Quit] |
| 14:43:57 | | Guest58 joins |
| 14:48:44 | | midou quits [Ping timeout: 256 seconds] |
| 14:57:53 | | midou joins |
| 15:08:05 | | MrMcNuggets quits [Ping timeout: 272 seconds] |
| 15:15:08 | | MrMcNuggets (MrMcNuggets) joins |
| 15:23:07 | | Dada quits [Remote host closed the connection] |
| 15:27:05 | | Mateon1 quits [Ping timeout: 272 seconds] |
| 15:27:09 | | Mateon2 joins |
| 15:29:21 | | Mateon1 joins |
| 15:32:09 | | Mateon2 quits [Ping timeout: 272 seconds] |
| 15:53:05 | | MrMcNugg1 (MrMcNuggets) joins |
| 15:54:32 | <justauser> | What's a good way to get direct links to videos from Bluesky, ideally the whole batch of post/m3u/mp4? |
| 15:55:36 | | MrMcNuggets quits [Ping timeout: 256 seconds] |
| 16:03:33 | <justauser> | https://tech-ish.com/2026/01/15/massive-piracy-crackdown-1337x-fmovies-and-150-others-facing-global-shutdown-full-list-inside/ |
| 16:04:04 | <justauser> | Probably shouldn't bother with full websites, but doing several captures of their homepages won't hurt. |
| 16:07:45 | <justauser> | https://transfer.archivete.am/gEsC1/20250115_piracy_court_order.txt |
| 16:07:46 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/gEsC1/20250115_piracy_court_order.txt |
| 16:11:26 | | TheTechRobo is now known as TheTechRobo1 |
| 16:11:53 | | TheTechRobo1 is now known as TheTechRobo |
| 16:28:28 | | Kotomind quits [Ping timeout: 256 seconds] |
| 16:34:32 | <h2ibot> | Imer edited Deathwatch (+5, /* 2026-01 */ correct eyeem shutdown date): https://wiki.archiveteam.org/?diff=60184&oldid=60149 |
| 16:45:27 | | nomadgeek quits [Client Quit] |
| 16:55:56 | | BornOn420 quits [Remote host closed the connection] |
| 17:04:47 | | rover joins |
| 17:07:09 | | midou quits [Ping timeout: 272 seconds] |
| 17:11:24 | | Aurora joins |
| 17:11:28 | | BennyOtt quits [Quit: ZNC 1.10.1 - https://znc.in] |
| 17:12:56 | | rover is now authenticated as rover |
| 17:13:01 | <Aurora> | hi i posted about this before but i have all but 10 of the 11.3 million gifs from tenor with legacy ids lying around, i see you guys created a page for it on the wiki, if youre interested id love to upload it somehow |
| 17:13:10 | | BennyOtt (BennyOtt) joins |
| 17:13:51 | <justauser> | Push to IA and share a link, I'd say. |
| 17:13:54 | <szczot3k> | Aurora: please post it using |
| 17:13:58 | <szczot3k> | https://transfer.archivete.am/ |
| 17:14:28 | <Aurora> | yeah ive tried to get it on archive.org but ive been struggling figuring out how to upload 2tb of data, im pretty inexperienced |
| 17:15:05 | <Aurora> | szczot3k will do, thank you |
| 17:15:26 | <justauser> | Do you have a list of links or the icons themselves? |
| 17:15:45 | <szczot3k> | wait |
| 17:15:53 | <justauser> | 2TB is indeed a lot, but if you have links, we can handle downloading then uploading. |
| 17:16:01 | <szczot3k> | ^+1 |
| 17:16:12 | <Aurora> | i have every json attached to the pages, and a list of valid urls |
| 17:16:27 | <justauser> | Which pages? |
| 17:17:18 | <Aurora> | anything with a legacy post url (up to about 20240904, with some extra beyond) |
| 17:18:11 | <justauser> | I'm not sure if we can handle JSONs in a meaningful way, but they are probably not too large. |
| 17:18:18 | <justauser> | So 7z and IA. |
| 17:18:28 | <justauser> | Links here, please. |
| 17:18:56 | <nyakase> | i assume you are referring to the JSONs under store-cache on the pages? those would end up being a part of the final capture regardless |
| 17:19:24 | <Aurora> | nyakase yeah those |
| 17:20:24 | <Aurora> | i checked every url between the ids 0-29,000,000 and 199,000,000-250,000,000 after doing some random checks to determine valid ranges |
| 17:22:32 | <Aurora> | heres the valid links, apologies for every line having .webm at the end, i got it by writing the directory https://transfer.archivete.am/bN4iE/ids.zip |
| 17:22:33 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/bN4iE/ids.zip |
| 17:23:21 | <justauser> | Thanks! |
| 17:23:31 | <justauser> | arkiver: ^ |
| 17:23:56 | <justauser> | Probably too large for AB, but would save us some hassle in DPoS. |
| 17:24:01 | <Aurora> | no problem! |
| 17:25:49 | <Aurora> | i didnt find a way to retrieve the substantial amount of posts without a legacy url as they were phased out at some point, maybe you guys will have better luck than me |
| 17:27:00 | <justauser> | I suggest keeping the data you already downloaded for a while even if not uploading it anywhere, in case we will have some problems. |
| 17:27:18 | <nyakase> | i can't think of ways other than relying on store-cache or paginating the search api. former is probably not ideal since they are capped, latter is not ideal either since a key has to be relied on |
| 17:27:36 | <nyakase> | but since you have a very large amount of store-caches, maybe you could crunch thru them to find ids of non-legacy gifs |
| 17:27:38 | <justauser> | Compressed JSONs can probably go straight to IA. |
| 17:28:32 | <Aurora> | i dont intend on deleting it although im not sure how i could easily be reached as i dont frequent this chat |
| 17:28:38 | <Aurora> | will upload the jsons to IA |
| 17:29:12 | <justauser> | If we have something for you, we will ask bot. |
| 17:29:28 | <justauser> | It will message you should you ever join again. |
| 17:30:01 | <Aurora> | i see i see |
| 17:30:38 | <Aurora> | my list of ids can easily be converted to links by appending them to tenor.com/view/[legacyid] |
| 17:31:43 | <Aurora> | without the .webm, that is |
| 17:36:31 | | Wohlstand (Wohlstand) joins |
| 17:37:15 | | midou joins |
| 17:38:24 | <Aurora> | zipping jsons will likely take a few days |
| 17:39:40 | <justauser> | 7z would compress way better. |
| 17:41:21 | | Wohlstand quits [Ping timeout: 272 seconds] |
| 17:42:08 | <Aurora> | thank you for the tip |
| 17:49:30 | | Juest quits [Ping timeout: 256 seconds] |
| 17:57:48 | <nicolas17> | I'm getting timeouts from maxmodels |
| 17:59:28 | <justauser> | Confirmed. |
| 18:00:35 | | Dada joins |
| 18:00:50 | <nicolas17> | not a ban because there are some =200's in between |
| 18:11:20 | <nicolas17> | or maybe they banned me but the CDN keeps giving me cached data? |
| 18:12:31 | | BornOn420 (BornOn420) joins |
| 18:15:20 | <justauser> | Proxy works, but my server that isn't a Warrior doesn't. |
| 18:15:38 | <justauser> | Perhaps they imported some blocklist? |
| 18:32:21 | <IDK> | https://usercontent.irccloud-cdn.com/file/T3z3aJHU/image.png |
| 18:32:27 | <IDK> | Fuzzy's graph looks interesting |
| 18:32:49 | <IDK> | I'm wondering if you get unbanned and banned instantly if you don't wait for a while |
| 18:52:58 | <fuzzy80211> | no i got banned, started a different group of ips then decided i am going to wait until after business hours to start again |
| 18:53:43 | <fuzzy80211> | whats running now is just a couple hetzner boxes that i didnt care as much if they noticed |
| 18:54:52 | | lennier2_ joins |
| 18:57:30 | | lennier2 quits [Ping timeout: 256 seconds] |
| 19:51:24 | | Juest (Juest) joins |
| 19:59:22 | <nicolas17> | maxmodels ETA 8.8 hours (Jan 16 04:44) |
| 20:34:22 | | malcomind quits [Quit: Connection closed for inactivity] |
| 20:43:15 | | Webuser401633 joins |
| 20:43:44 | | JayEmbee quits [Quit: WeeChat 4.1.1] |
| 20:43:54 | | Webuser401633 quits [Client Quit] |
| 20:57:03 | | BearFortress quits [Ping timeout: 272 seconds] |
| 21:04:13 | <h2ibot> | JustAnotherArchivist edited EyeEm (+148, Add official shutdown notice and note the date…): https://wiki.archiveteam.org/?diff=60185&oldid=60159 |
| 21:06:41 | <datechnoman> | Ban hammer struck all my nodes too :( lame |
| 21:11:32 | | BearFortress joins |
| 21:11:46 | <Guest> | name idea for tenor: #losttenure |
| 21:12:10 | <klea> | seems good |
| 21:12:17 | <klea> | ,join #losttenure |
| 21:12:18 | <eggdrop> | [join] joined #losttenure - set flags +lkarma +transferinliner +seen +8ball |
| 21:17:03 | <Guest> | was this channel created just now? |
| 21:17:15 | <h2ibot> | Nyakase uploaded File:Tenor logo.png: https://wiki.archiveteam.org/?title=File%3ATenor%20logo.png |
| 21:18:39 | <klea> | Guest: yes. |
| 21:21:16 | <Guest> | :D must have been a really good name |
| 21:22:16 | <h2ibot> | Klea edited Tenor (+214, Add Infobox): https://wiki.archiveteam.org/?diff=60187&oldid=60153 |
| 21:24:16 | <h2ibot> | Nyakase uploaded File:Tenor screenshot.png: https://wiki.archiveteam.org/?title=File%3ATenor%20screenshot.png |
| 21:25:16 | <h2ibot> | Nyakase uploaded File:Tenor screenshot.png (crop, that height is probably unsuitable): https://wiki.archiveteam.org/?title=File%3ATenor%20screenshot.png |
| 21:26:16 | <h2ibot> | Klea edited Tenor (+101, Add screenshot): https://wiki.archiveteam.org/?diff=60190&oldid=60187 |
| 21:27:10 | <klea> | i hope nyakase wasn't editing [[Tenor]] and hadn't added other text other than the infobox whilst i edited it. |
| 21:27:57 | | nyakase pouts. |
| 21:28:17 | <nyakase> | but it's okay |
| 21:28:44 | <klea> | sorry i should've waited a reasonable amount of time (at least 30m) before changing it |
| 21:35:41 | <klea> | sec^nd: See <https://wiki.archiveteam.org/index.php/Frequently_Asked_Questions#Why_can't_I_download_the_data_for_some_projects?> as a response to your question on #archiveteam |
| 21:35:42 | <TheTechRobo> | (from #archiveteam) sec^nd: see https://wiki.archiveteam.org/index.php/Frequently_Asked_Questions#Why_can't_I_download_the_data_for_some_projects? |
| 21:35:43 | <TheTechRobo> | tldr: LLM training messed it up :/ |
| 21:35:56 | <klea> | oh, we said it at the same time :p |
| 21:35:58 | <nyakase> | klea: i uploaded it for an infobox, but i was on the edge about adding the irc channel or not, so your edit clarified for me anyway :) |
| 21:36:26 | <sec^nd> | Why is are the archives marked as restricted? https://archive.org/download/archiveteam_urls_20251231102426_e958019e |
| 21:36:30 | <klea> | nyakase: i mean, technically it's not "official" until someone with op to #archiveteam decides to set the magic ban extmode |
| 21:36:55 | | MrMcNugg1 quits [Quit: WeeChat 4.3.2] |
| 21:37:06 | | MrMcNuggets (MrMcNuggets) joins |
| 21:37:20 | | MrMcNuggets quits [Client Quit] |
| 21:38:29 | <TheTechRobo> | sec^nd: See above your message in this channel |
| 21:43:34 | | BearFortress quits [Client Quit] |
| 21:49:22 | <sec^nd> | TheTechRobo: Thanks |
| 21:51:20 | <h2ibot> | Nyakase edited Tenor (+622, fill infobox, change date format, add more…): https://wiki.archiveteam.org/?diff=60191&oldid=60190 |
| 21:57:20 | <h2ibot> | Klea edited Tenor (+107, Wikify): https://wiki.archiveteam.org/?diff=60192&oldid=60191 |
| 22:02:49 | | BearFortress joins |
| 22:11:22 | <h2ibot> | Klea edited Tenor (-4, Don't link to [[DPos]] since then it won't get…): https://wiki.archiveteam.org/?diff=60193&oldid=60192 |
| 22:19:38 | <@JAA> | pabs: Yeah, good point on new page announcements. |
| 22:22:04 | | Shjosan quits [Ping timeout: 256 seconds] |
| 22:23:24 | <h2ibot> | Nyakase edited Glitch (+6, unlink DPoS, date format): https://wiki.archiveteam.org/?diff=60194&oldid=60178 |
| 22:24:20 | <klea> | > https://cit.is/ |
| 22:24:47 | <klea> | huh new citations page? |
| 22:25:32 | <nyakase> | Pricing page that has a free tier that says "Artifacts will be hosted on cit.is for at least 3 years. After that, we may ask you to upgrade to preserve older archives." |
| 22:26:39 | <klea> | and the other thing i don't entirely like is: "The backend pairs a standard ArchiveBox instance" <https://sij.law/deepciter/#:~:text=The%20backend,Singlefile%2E> <- ArchiveBot, not to be confused with ArchiveBot, does not make valid warcs, because it uses wget behind the scenes. |
| 22:27:20 | <klea> | > Disclaimer: This is a demo instance only, intended as a public proof-of-concept and a temporary playground. I make no guarantee to host archives you may create on it for any duration. Any sites you archive while using this demo are visible to me and may be visible to others. Please use it with discretion. |
| 22:27:21 | <klea> | huh |
| 22:27:52 | <klea> | i'm trying to find a email for them so i can AFN the forgejo |
| 22:35:12 | <klea> | i hope they configured ArchiveBox to request SPN archivals too. |
| 22:40:53 | | MrMcNuggets (MrMcNuggets) joins |
| 22:43:16 | | Shjosan (Shjosan) joins |
| 22:43:26 | <h2ibot> | Klea edited Discourse/uncategorized (+41, Add CodeFloe - forum.codefloe.com): https://wiki.archiveteam.org/?diff=60195&oldid=59027 |
| 22:52:27 | <h2ibot> | Klea edited Discourse/uncategorized (+63, Add elixirforum.com): https://wiki.archiveteam.org/?diff=60196&oldid=60195 |
| 22:58:15 | | Shjosan quits [Read error: Connection reset by peer] |
| 22:59:11 | | Shjosan (Shjosan) joins |
| 23:07:52 | | pika joins |
| 23:09:00 | | pika quits [Client Quit] |
| 23:14:12 | | sg72 quits [Ping timeout: 256 seconds] |
| 23:16:58 | | atphoenix__ (atphoenix) joins |
| 23:18:28 | | sg72 joins |
| 23:19:18 | | atphoenix_ quits [Ping timeout: 256 seconds] |
| 23:32:03 | | Island joins |
| 23:32:26 | | pika joins |
| 23:33:12 | | nexussfan (nexussfan) joins |
| 23:37:42 | | Wohlstand (Wohlstand) joins |
| 23:42:21 | | Wohlstand quits [Ping timeout: 272 seconds] |
| 23:46:25 | | Shjosan quits [Client Quit] |
| 23:47:26 | | Shjosan (Shjosan) joins |