00:01:23 | | etnguyen03 quits [Client Quit] |
00:18:14 | | ducky quits [Ping timeout: 240 seconds] |
00:23:16 | | etnguyen03 (etnguyen03) joins |
00:27:52 | | lemuria quits [Read error: Connection reset by peer] |
00:28:12 | | lemuria (lemuria) joins |
00:33:14 | | cascode quits [Ping timeout: 240 seconds] |
00:34:33 | | cascode joins |
00:38:58 | | pabs (pabs) joins |
00:41:29 | | cascode quits [Ping timeout: 260 seconds] |
00:46:13 | | cascode joins |
00:51:36 | <h2ibot> | Dango360 edited Itch.io (+12, /* Grab */ Add link to ArchiveBot page): https://wiki.archiveteam.org/?diff=56624&oldid=56623 |
00:52:34 | | cascode quits [Ping timeout: 240 seconds] |
00:55:07 | | cascode joins |
00:56:03 | | dabs quits [Read error: Connection reset by peer] |
01:19:36 | | gatagoto quits [Client Quit] |
01:20:23 | | mls quits [Quit: leaving] |
01:27:43 | | etnguyen03 quits [Client Quit] |
01:28:04 | | etnguyen03 (etnguyen03) joins |
01:37:49 | | etnguyen03 quits [Client Quit] |
01:38:29 | | Wake1 joins |
01:38:39 | | cascode quits [Ping timeout: 260 seconds] |
01:38:55 | | cascode joins |
01:40:54 | | Wake quits [Ping timeout: 240 seconds] |
01:40:54 | | Wake1 is now known as Wake |
01:45:39 | | cascode quits [Ping timeout: 260 seconds] |
01:46:01 | | cascode joins |
01:48:36 | <gamer191-1|m> | <katia> "0.5 means half the workers are..." <- I’m confused, how does that relate to the rate limit? |
01:49:02 | <nicolas17> | the rate limit is not shown anywhere public, no |
01:49:12 | <nicolas17> | it's 110/minute on itch right now |
01:49:56 | <nicolas17> | I think the server supports more complicated limits, like "if the item name starts with X then use a lower limit for those" |
01:54:11 | | etnguyen03 (etnguyen03) joins |
02:10:56 | | lemuria_ (lemuria) joins |
02:11:54 | | lemuria quits [Ping timeout: 240 seconds] |
02:21:24 | | lemuria (lemuria) joins |
02:23:54 | | etnguyen03 quits [Client Quit] |
02:24:14 | | lemuria_ quits [Ping timeout: 240 seconds] |
02:25:35 | | etnguyen03 (etnguyen03) joins |
02:25:54 | | lemuria quits [Ping timeout: 240 seconds] |
02:27:12 | | lemuria (lemuria) joins |
02:28:05 | <TheTechRobo> | gamer191-1|m: It lets you see which projects need more help. If a project has a low item request serve rate, your worker probably won't get any items at that point in time. |
02:31:34 | | lemuria quits [Ping timeout: 240 seconds] |
02:32:12 | | lemuria (lemuria) joins |
02:37:14 | | etnguyen03 quits [Remote host closed the connection] |
02:38:08 | | Mateon2 joins |
02:38:34 | | Mateon1 quits [Ping timeout: 240 seconds] |
02:38:45 | | Mateon1 joins |
02:40:25 | | Webuser023577 joins |
02:42:15 | <Webuser023577> | Hello I am curious if there is some way to find a user's ask.fm posts from the ask.fm archive |
02:42:49 | | Mateon2 quits [Ping timeout: 260 seconds] |
02:43:22 | | Webuser023577 quits [Client Quit] |
02:51:08 | | lemuria quits [Read error: Connection reset by peer] |
02:51:45 | | lemuria (lemuria) joins |
02:54:05 | | tzt quits [Remote host closed the connection] |
02:54:25 | | tzt (tzt) joins |
02:56:14 | | lemuria quits [Ping timeout: 240 seconds] |
02:57:31 | | lemuria (lemuria) joins |
02:58:07 | | lemuria quits [Read error: Connection reset by peer] |
03:07:31 | | lemuria (lemuria) joins |
03:11:34 | | lemuria quits [Ping timeout: 240 seconds] |
03:33:03 | | lemuria (lemuria) joins |
03:38:49 | | lemuria quits [Ping timeout: 260 seconds] |
03:42:28 | <gamer191-1|m> | <TheTechRobo> "gamer191-1: It lets you see..." <- Oh, that makes sense |
03:42:28 | <gamer191-1|m> | Thanks TheTechRobo |
03:43:50 | | lemuria (lemuria) joins |
03:45:28 | | Webuser743529 joins |
03:45:51 | | chrismeller8 quits [Quit: Ping timeout (120 seconds)] |
03:45:55 | | TheTechRobo quits [Quit: Ping timeout (120 seconds)] |
03:46:32 | <gamer191-1|m> | !tell Webuser023577 couldn’t you just use web.archive.org? |
03:46:33 | <eggdrop> | [tell] ok, I'll tell Webuser023577 when they join next |
03:46:50 | | wyatt8740 quits [Quit: ZNC got killed or something else has gone wrong, probably.] |
03:47:06 | | yasomimi (yasomi) joins |
03:47:49 | | ScenarioPlanet quits [Quit: Ping timeout (120 seconds)] |
03:47:50 | | Pedrosso quits [Quit: Ping timeout (120 seconds)] |
03:48:06 | | wyatt8740 joins |
03:48:10 | <Webuser743529> | !tell gamer191-1|m I am looking for https://ask.fm/a/beh4mfid which is from user 108 I believe and it's not archived it goes to a redirect |
03:48:11 | <eggdrop> | [tell] gamer191-1|m is here - they should see your message :) |
03:48:12 | | ScenarioPlanet (ScenarioPlanet) joins |
03:48:28 | | Soulflare quits [Quit: http://drsclan.net] |
03:48:43 | <Webuser743529> | I have no clue how to use IRC I tried to install mirc and I accidently put myself in effnet instead of hackint.org |
03:48:44 | | tzt quits [Ping timeout: 260 seconds] |
03:48:44 | | yasomi quits [Ping timeout: 260 seconds] |
03:48:44 | | yasomimi is now known as yasomi |
03:49:34 | | tzt (tzt) joins |
03:49:54 | | d10n quits [Ping timeout: 260 seconds] |
03:50:24 | | Soulflare joins |
03:51:04 | <@OrIdow6> | It looks like it wasn't saved much? |
03:53:19 | <Webuser743529> | !tell Orldow6 I am an idiot the user is actually cvii which is on internet archive but there isn't anything there. It's from 2024 and looks to be that everything was deleted. So I think looking at the databases could be helpful if they are not from 2024 or maybe there are other ask.fm archives that are from earlier than 2024 |
03:53:20 | <eggdrop> | [tell] ok, I'll tell Orldow6 when they join next |
03:53:40 | | Webuser175935 joins |
03:54:19 | | Webuser175935 quits [Client Quit] |
03:55:07 | | Webuser743529 quits [Client Quit] |
03:55:09 | | Church quits [Ping timeout: 260 seconds] |
03:55:11 | <@OrIdow6> | Webuser743529: I'm talking about ask.fm generally, I wasn't involved in that project |
03:55:45 | <@OrIdow6> | Oh |
03:56:11 | <@OrIdow6> | Anyway it would be nice if someone involved would update the wiki a bit on how much "partially saved" entails/a bit of an after action report on the project |
03:56:20 | | d10n joins |
03:57:11 | | Webuser086390 joins |
03:57:50 | | Webuser086390 quits [Client Quit] |
04:01:39 | <gamer191-1|m> | <Webuser743529> "I have no clue how to use IRC..." <- Matrix (https://matrix.to/#/%23archiveteam-bs:hackint.org) is way easier |
04:02:28 | <nicolas17> | matrix doesn't let you see that they already left :P |
04:03:45 | <gamer191-1|m> | Oh, it usually does |
04:10:47 | <gamer191-1|m> | Speaking of Matrix, does anyone know why #scratchtheitch isn’t bridged? (You can join that channel on Matrix, but it’s not bridged because the list of users isn’t synchronised and messages aren’t transferred) |
04:21:48 | | Church (Church) joins |
04:41:28 | | lemuria quits [Read error: Connection reset by peer] |
04:48:59 | | lemuria (lemuria) joins |
04:51:10 | | awauwa (awauwa) joins |
04:51:59 | | lemuria quits [Read error: Connection reset by peer] |
04:52:03 | | Wake8 joins |
04:53:54 | | Wake quits [Ping timeout: 240 seconds] |
04:53:54 | | Wake8 is now known as Wake |
04:53:59 | | lemuria (lemuria) joins |
05:18:34 | | cascode quits [Ping timeout: 260 seconds] |
05:19:25 | | cascode joins |
05:20:58 | | fuzzy8021 quits [Read error: Connection reset by peer] |
05:23:19 | | fuzzy80211 (fuzzy80211) joins |
05:23:34 | | cascode quits [Ping timeout: 240 seconds] |
05:23:59 | | cascode joins |
05:33:27 | | SDRedneck quits [Quit: Lost terminal] |
05:37:49 | | cascode quits [Ping timeout: 260 seconds] |
05:38:15 | | cascode joins |
05:44:14 | | archiveDrill quits [Ping timeout: 260 seconds] |
05:44:54 | | cascode quits [Ping timeout: 240 seconds] |
05:44:59 | | cascode joins |
05:47:12 | | archiveDrill joins |
05:51:49 | | cascode quits [Ping timeout: 260 seconds] |
05:52:26 | | cascode joins |
06:03:48 | | Juesto (Juest) joins |
06:06:54 | | Juest quits [Ping timeout: 240 seconds] |
06:06:54 | | Juesto is now known as Juest |
06:08:39 | | fuzzy8021 (fuzzy80211) joins |
06:09:54 | | cascode quits [Ping timeout: 260 seconds] |
06:10:55 | | cascode joins |
06:11:04 | | fuzzy80211 quits [Ping timeout: 260 seconds] |
06:13:54 | | BearFortress quits [Ping timeout: 240 seconds] |
06:18:25 | | cuphead2527480 (Cuphead2527480) joins |
06:21:34 | | DopefishJustin quits [Ping timeout: 260 seconds] |
06:21:37 | | DopefishJustin joins |
06:21:37 | | DopefishJustin is now authenticated as DopefishJustin |
06:26:34 | | cascode quits [Read error: Connection reset by peer] |
06:26:45 | | cascode joins |
06:30:46 | | notSokar joins |
06:33:14 | | Sokar quits [Ping timeout: 260 seconds] |
06:54:51 | | ArchivalEfforts quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
06:55:00 | | ArchivalEfforts joins |
07:22:28 | | Guest58 joins |
07:26:47 | <h2ibot> | That lurker edited Archiveteam:IRC/Relay (+79): https://wiki.archiveteam.org/?diff=56625&oldid=55613 |
07:35:31 | <BlankEclair> | finally, a table that's not class="wikitable" |
07:36:02 | | hexagonwin|m joins |
08:00:24 | | ducky (ducky) joins |
08:22:52 | | BearFortress joins |
08:28:14 | | cuphead2527480 quits [Client Quit] |
08:52:26 | | pie_ (pie_) joins |
08:59:37 | | APOLLO03 joins |
09:10:48 | | BennyOtt_ joins |
09:11:54 | | BennyOtt quits [Ping timeout: 260 seconds] |
09:11:54 | | BennyOtt_ is now known as BennyOtt |
09:11:54 | | BennyOtt is now authenticated as BennyOtt |
09:12:04 | | Dada joins |
09:19:03 | | @arkiver is fully back |
09:19:54 | | Juest quits [Ping timeout: 240 seconds] |
09:20:16 | | Juest (Juest) joins |
09:27:46 | | beastbg8 quits [Read error: Connection reset by peer] |
10:15:51 | | that_lurker is fully front |
10:18:27 | | BlankEclair is fully fronting |
11:00:03 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
11:02:46 | | Bleo182600722719623455222 joins |
11:07:37 | | Jonboy345 joins |
11:19:27 | | beastbg8 (beastbg8) joins |
11:36:07 | <@JAA> | That was the second !tell Orldow6 within 24 hours. lol |
11:36:24 | | TheTechRobo (TheTechRobo) joins |
11:37:51 | <that_lurker> | !tell JÅÅ now we just need to wait for someone with that nick to join and be confused :-P |
11:37:52 | <eggdrop> | [tell] ok, I'll tell JÅÅ when they join next |
11:46:21 | | beastbg8_ joins |
11:48:34 | | nstrom joins |
11:49:24 | | beastbg8 quits [Ping timeout: 260 seconds] |
11:50:52 | | manu|m leaves [User left] |
12:03:17 | | Czechball2 joins |
12:05:09 | | Czechball quits [Ping timeout: 260 seconds] |
12:05:09 | | Czechball2 is now known as Czechball |
12:16:34 | | TheEnbyperor_ quits [Ping timeout: 240 seconds] |
12:16:49 | | TheEnbyperor quits [Ping timeout: 260 seconds] |
12:28:14 | <gamer191-1|m> | <JAA> "That was the second !tell Orldow..." <- It’s cause their name has a capital i which is confusing |
12:31:34 | | BornOn420 (BornOn420) joins |
12:39:50 | <@JAA> | bad fonts-- |
12:39:51 | <eggdrop> | [karma] 'bad fonts' now has -1 karma! |
12:40:22 | | sec^nd (second) joins |
12:50:16 | | mls (mls) joins |
12:50:57 | | TheEnbyperor joins |
12:51:07 | <that_lurker> | Comic Sans++ |
12:51:09 | <eggdrop> | [karma] 'Comic Sans' now has 1 karma! |
12:52:23 | | TheEnbyperor_ (TheEnbyperor) joins |
13:00:45 | | eggdrop quits [Remote host closed the connection] |
13:02:30 | | SootBector (SootBector) joins |
13:03:56 | <masterx244|m> | mixes a | into the package |
13:04:23 | <cruller> | I've listed the URLs for these 692 videos, so could someone please submit this to AB? https://transfer.archivete.am/inline/12UUzP/nhklearning_videos.txt |
13:10:33 | | eggdrop (eggdrop) joins |
13:29:32 | | nstrom quits [Client Quit] |
13:31:56 | <masterx244|m> | did eggdrop eat a cockroach? |
13:50:19 | <that_lurker> | !8ball did eggdrop eat a cockroach |
13:50:19 | <eggdrop> | 🎱: that_lurker, outlook not so good |
14:07:43 | <@imer> | oh no |
14:13:51 | <@OrIdow6> | JAA: Admittedly, because they tend to be done by people new to IRC, when I glance at them my first thought is sometimes that it's someone who believes they need to use the !tell command to send all their messages |
14:14:17 | <@OrIdow6> | Did you ever figure out why the channel wasn't bridget gamer191-1|m? |
14:15:58 | <@JAA> | Yeah |
14:16:13 | <@JAA> | It's probably a bridge bug and has been brought up in #hackint. |
14:16:53 | | hexa|m (hexa-) joins |
14:18:04 | <@OrIdow6> | Ah, and here's hackint themselves |
14:25:16 | <hexa|m> | uh-hu |
14:25:18 | <hexa_> | h |
14:25:35 | <hexa_> | I mean, it's super slow, but it works? |
14:26:06 | <hexa_> | and by super slow I mean in the region of 10s delay |
14:26:09 | <hexa_> | on good days |
14:30:24 | | TheEnbyperor_ quits [Ping timeout: 260 seconds] |
14:30:25 | <@imer> | hexa_: not seeing any matrix people in #scratchtheitch - that might be broken still? |
14:30:54 | | TheEnbyperor quits [Ping timeout: 240 seconds] |
14:31:26 | | hexa|m leaves [User left] |
14:36:00 | <masterx244|m> | and no messages from IRC coming over at all into the matrix end |
14:36:39 | <hexa_> | wworks for me |
14:36:47 | <hexa_> | but note that I sometimes use !join in the appservice DM |
14:37:04 | <hexa_> | if it fails then that gets me better feedback |
14:39:57 | <masterx244|m> | quirky trap that a room with the same name existed on the network thats linkable but ends up being a dead end. |
14:43:45 | <hexa_> | huh |
15:06:39 | | TheEnbyperor (TheEnbyperor) joins |
15:09:15 | | TheEnbyperor_ joins |
15:29:03 | | Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ] |
15:30:04 | | Shjosan (Shjosan) joins |
15:34:41 | | fuzzy8021 is now known as fuzzy80211 |
15:43:26 | <gamer191-1|m> | <masterx244|m> "quirky trap that a room with the..." <- Would it be easiest to just switch to a different channel name (CC OrIdow6) |
15:46:45 | <@JAA> | Meh |
15:46:50 | <@arkiver> | i'm not sure if we should switch cannels becaue of bridge problems |
15:46:53 | <@JAA> | ^ |
15:47:00 | <@arkiver> | i don't think we should |
15:47:53 | <@OrIdow6> | Huh |
15:47:56 | <@arkiver> | this may be good time to start using IRC directly :) |
15:48:13 | <@OrIdow6> | Out of curiosity what network is that? Matrix.org? I don't understand how the bridge works |
15:48:42 | <@OrIdow6> | I would've thought the Matrix server was hackint.org |
15:50:04 | <@OrIdow6> | arkiver: I'd tenuously support switching it over if it's the only option, but I don't like it, that's trading one set of inconveniences for another |
15:51:15 | <@imer> | not a clue how the matrix stuff works either, figured it was just broken, but apparently sorta works? |
15:51:33 | <@JAA> | 60% of the time, it works every time! |
15:52:19 | <@arkiver> | OrIdow6: we're on IRC mainly, not another platform, though there is *something* on discord i believe. i would not support adjusting IRC because of a buggy tool |
15:53:02 | <@arkiver> | i guess whoever handles the bridge should fix what needs fixing, and/or it may be a good opportunity to start using IRC directly |
15:53:49 | <@JAA> | Agreed |
15:54:14 | <gamer191-1|m> | <arkiver> "this may be good time to start..." <- But then I miss all the messages overnight, unless I leave my computer on |
15:55:31 | <gamer191-1|m> | <OrIdow6> "I would've thought the Matrix..." <- It is |
15:55:31 | <gamer191-1|m> | But similar to Mastodon, Matrix allows you to create an account on any server and then use it on any other server |
15:56:01 | <@arkiver> | matrix should fix that then! |
15:56:20 | <@arkiver> | gamer191-1|m: perhaps you could look into connecting through a machine that is always on |
15:56:43 | <@arkiver> | at Hetzner you could rent something for a few bucks per month, but you'd need to set it up |
15:56:51 | <@arkiver> | maybe someone has a good guide (should we make one if not?) |
15:57:31 | <gamer191-1|m> | arkiver: Fix what? I'm confused |
15:57:46 | <@OrIdow6> | arkiver: I don't think I'd support a Discord bridge but I want to be pragmatic with this, we throw out "use Matrix if you have trouble" a lot, and looking at the sidebar of this room a lot of people end up using it, I'd hate to deny them access |
15:58:03 | <@JAA> | > we throw out "use Matrix if you have trouble" a lot |
15:58:09 | <@JAA> | Do we? I don't think we've ever endorsed it before. |
15:58:10 | <@OrIdow6> | Setting up a bouncer is a big effort for someone who doesn't know what a folder is |
15:58:29 | <@OrIdow6> | (Not saying that gamer... is, just that this happens) |
15:59:41 | <@OrIdow6> | JAA: Huh, I guess we haven't, I thought it was mentioned in the wiki |
15:59:56 | <@JAA> | The wiki doesn't mention Matrix anywhere either as far as I can tell. The discussion about moving from EFnet to hackint does have this gem though: |
15:59:59 | <@JAA> | > Here's hoping for Matrix to mature in the coming months |
16:00:02 | <@OrIdow6> | It goes get thrown around a lot tho and again, I just want to be pragmatic |
16:00:02 | <@JAA> | That was added in 2021. :-) |
16:00:04 | <@arkiver> | i have never recommended matrix either, maybe it's on the wiki in which case a warning/note should be added |
16:00:20 | <gamer191-1|m> | arkiver: Nah, I could easily figure it out (although I'm not gonna cause it's a waste of a server and I can just communicate using eggdrop) |
16:00:30 | <@OrIdow6> | gamer191-1|m: But if you join like #scratchtheitch:hackint.org , that doesn't work? |
16:00:55 | <@JAA> | At least one Matrix user is in the channel, so it should be possible to join. |
16:01:02 | <@OrIdow6> | It sounds as if all channels in Matrix are in one huge global namespace, but I thought it should be by-server |
16:01:08 | <gamer191-1|m> | OrIdow6: No, cause there's some weird issue with that specific channel |
16:01:22 | <Vokun> | I have no idea how i came across matrix, but the ease of setting up made me deal with all the bugs till recently |
16:01:28 | <Vokun> | Now i'm questioning whether I want to keep it |
16:01:45 | <gamer191-1|m> | JAA: Oh |
16:01:45 | <gamer191-1|m> | masterx244 how did you do that? |
16:01:59 | <gamer191-1|m> | OrIdow6: It is by-server |
16:02:06 | <@imer> | gamer191-1|m: hexa managed to join earlier, so it's possible to somehow |
16:03:16 | <@arkiver> | gamer191-1|m: i would say it's not a waste of a server, because Archive Team is worth it :) |
16:03:28 | <@imer> | "<hexa_> but note that I sometimes use !join in the appservice DM" not sure what that means, but maybe that helps |
16:05:18 | <gamer191-1|m> | imer: Oh wow, that does work |
16:05:23 | <@OrIdow6> | It'd be nice if we had some kind of bouncer to send other people to |
16:05:29 | <@OrIdow6> | *new people |
16:05:34 | <@OrIdow6> | Maybe that can only join AT channels |
16:06:03 | <gamer191-1|m> | For anyone else, if you wanna access #scratchtheitch:hackint.org on Matrix DM appservice-irc with the text "!join #scratchtheitch:hackint.org" |
16:07:54 | <Vokun> | Been trying different versions of that since I saw them mention it, but it asks for a certain format. '!join irc.example.com #channel [key]' |
16:08:38 | <gamer191-1|m> | Nope, I just sent !join #scratchtheitch |
16:09:18 | <Vokun> | Interesting. it works fine when I copy paste what you put, but not when i typed it |
16:10:27 | <@arkiver> | and there we almost moved the channel to something else due to a buggy matrix problem |
16:11:39 | <ineffyble> | Something that's a bit easier to do than a bouncer is Quassel |
16:11:52 | <ineffyble> | For individual users I mean, not hosting a shared one |
16:12:06 | <gamer191-1|m> | Thanks everyone for the help btw |
16:13:15 | <h2ibot> | JustAGrook edited Talk:Goo.gl (+225): https://wiki.archiveteam.org/?diff=56627&oldid=56603 |
16:14:15 | <h2ibot> | JustAGrook created Hungary (+181, Created page with "{{Infobox Country…): https://wiki.archiveteam.org/?title=Hungary |
16:16:34 | | Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…] |
16:24:39 | <@arkiver> | looking into sketch.piciv.net for the upcoming deadline next month |
16:24:46 | <@arkiver> | goo.gl is already covered |
16:25:07 | <@arkiver> | "sjwaampop" put in ArchiveBot |
16:26:33 | <@arkiver> | blog.goo.ne.jp will start early due to size, same channel as the previous goo stuff, #itsgoone |
16:27:17 | <@arkiver> | same for oshiete.goo.ne.jp , seems to be ~14 million sequentially numbered questions. |
16:27:59 | <@arkiver> | ranking.goo.ne.jp seems small, putting it in ArchiveBot |
16:30:28 | <@arkiver> | (it's already running - thanks Exorcism ) |
16:35:28 | <@JAA> | sketch.pixiv.net* and I guess we can reuse #pixeled for that. |
16:36:09 | <@arkiver> | yes let's reuse that |
16:37:08 | <@arkiver> | deathwatch is actually looking a bit empty towards the end of the year, first half was very full |
16:38:14 | | grill (grill) joins |
16:38:22 | | Wohlstand (Wohlstand) joins |
16:43:29 | <@imer> | ssshh :p |
16:44:16 | <nicolas17> | ^ |
16:44:40 | <@arkiver> | :P |
16:46:34 | <hexagonwin_> | i've tried to write a proper wget-lua based scraper on my own, but I can't really understand how lol. For now I've written down some information required for scraping this site here <http://p.z80.kr/tistory_archiveteam.html>. Maybe should I create a wiki page? Any advice is appreciated.. |
16:50:49 | <@arkiver> | hexagonwin_: nice detailed page! you mention the sequential ID for each blog, did you find a way to get the blog URL given a sequential ID? |
16:51:20 | <hexagonwin_> | arkiver: thanks for the reply. it's not a sequential ID for each blog, it's for each post and that's not really important since it's all on sitemap.xml anyway. |
16:51:52 | <@arkiver> | if there's not too many and the homepages can be easily scanned, it may be good to do a periodic scan over all blog and check which ones are nearing the 5 year mark, and get those into ArchiveBot |
16:52:20 | <@arkiver> | hexagonwin_: i see, so the sequential IDs are per-site instances, instead of a shared sequential ID across all sites? |
16:52:40 | <hexagonwin_> | it's hard to know which blogs are hitting the 5 year mark, some blogs are not updated for more than a decade (at least publicly) yet still remain. |
16:53:02 | <@arkiver> | those should be archived then, we could just check the date of the latest post on the front page |
16:53:05 | <hexagonwin_> | i'm not sure what you mean by the sequential post id being per-site? |
16:53:43 | <hexagonwin_> | also, is archivebot capable of archiving things loaded by js or modifying image url to get the best version? |
16:54:04 | <@arkiver> | i'm talking about |
16:54:07 | <@arkiver> | > Though, the numeric ID (110838 for cdmanii) can be found from the mobile page |
16:54:21 | <@arkiver> | is that a numeric ID for the entire blog? or indeed per post? |
16:54:28 | <hexagonwin_> | that's the ID for that blog. |
16:54:33 | <@arkiver> | ah |
16:54:41 | <@arkiver> | is there a way to get the blog when you have only the ID? |
16:54:47 | <@arkiver> | (i did not check yet myself) |
16:54:51 | <hexagonwin_> | I'm not aware of any way of getting the blog username from that ID. |
16:55:12 | <hexagonwin_> | tbh i think i've already found enough blog usernames actually |
16:56:37 | <hexagonwin_> | just checked now and it's 661133 usernames, not verified. i'm keep running the checker script, 142676 of 652791 done right now and the valid:invalid ratio is about 3:1 |
16:57:21 | <hexagonwin_> | assuming that's all right we get a list of about 0.5M actually existing blogs |
16:58:59 | <@arkiver> | hexagonwin_: perhaps an old project like the one for retrospring-grab https://github.com/ArchiveTeam/retrospring-grab/blob/master/retrospring.lua can be interesting for wget-lua code |
16:59:14 | | Juest quits [Ping timeout: 240 seconds] |
17:01:56 | | Juest (Juest) joins |
17:02:09 | <hexagonwin_> | arkiver: thanks. may i ask how i can test-run those scripts? i've tried running some after editing with --lua-script but I get errors related to not being able to load the utf8 module.. is there some recommended environment to test them? should i install warrior and get shell on it? |
17:03:41 | <@arkiver> | there are several Lua dependencies, of course |
17:05:43 | | i_have_n0_idea37 quits [Quit: The Lounge - https://thelounge.chat] |
17:06:02 | | i_have_n0_idea37 (i_have_n0_idea) joins |
17:06:59 | <hexagonwin_> | i've searched and some says utf8 is included with lua5.3, tried installing https://github.com/starwing/luautf8 anyway but doesn't work. i'm actually not even sure which lua wget-at is using lol (my system seems to have 5.1, 5.3 and luajit2.1) |
17:07:03 | <@arkiver> | those currently being installed and available are https://github.com/ArchiveTeam/grab-base-df/blob/master/rocks/requirements-0-0.rockspec#L11-L23 |
17:07:17 | <@arkiver> | you need 5.1 |
17:08:33 | <hexagonwin_> | my wget-at --version seems to indicate it's linked with luajit, is it built wrong? https://termbin.com/z3c1 |
17:10:44 | <@arkiver> | that would be libluajit-5.1 , basically the luajit version of lua5.1 |
17:16:26 | <@arkiver> | hexagonwin_: i will be off for ~9 hours. on your question of AB, js and custom image URLs, generally no, but please provide examples of this js and the images. |
17:16:39 | | shinon71 quits [Ping timeout: 260 seconds] |
17:17:12 | <@arkiver> | in the case of the images you could get a list of their arhcived URL afterwards from the ArchiveBot CDX on IA, generate new image URLs from that and archive those |
17:17:23 | <@arkiver> | small custom scripts may also be something i want to support in #Y |
17:17:33 | <@arkiver> | more on that tomorrow |
17:18:05 | <hexagonwin_> | arkiver: for js i meant comments/guestbook and images i've already written on that page. anyway, thanks :) |
17:24:36 | | awauwa quits [Quit: awauwa] |
17:29:50 | | shinon71 joins |
17:51:14 | | grill quits [Ping timeout: 240 seconds] |
17:52:16 | | grill (grill) joins |
18:31:52 | | BearFortress_ joins |
18:32:01 | | BearFortress_ quits [Client Quit] |
18:35:24 | | BearFortress quits [Ping timeout: 260 seconds] |
18:52:39 | <h2ibot> | OrIdow6 edited Itch.io (+130, /* Site structure notes */ Downloadableness): https://wiki.archiveteam.org/?diff=56629&oldid=56624 |
18:57:07 | | Island joins |
19:05:00 | <gamer191-1|m> | Don’t think so (I remember that being an issue for the usgovernment project) |
19:06:17 | <gamer191-1|m> | Stupid Mattix😡. Anyway, that was a reply to nicolas17 `I think the server supports more complicated limits, like "if the item name starts with X then use a lower limit for those"` |
19:09:49 | | grill quits [Ping timeout: 260 seconds] |
19:21:52 | <@JAA> | Replies don't render well on the IRC side anyway. It's better to reference what you're replying to. |
19:22:11 | <@JAA> | And yes, the tracker does support such things, it just doesn't scale well, which is why we didn't use it on that project. |
19:22:35 | | Wake8 joins |
19:25:54 | | Wake quits [Ping timeout: 240 seconds] |
19:25:54 | | Wake8 is now known as Wake |
19:41:12 | | BearFortress joins |
19:49:25 | | dabs joins |
20:22:19 | | dabs quits [Client Quit] |
20:31:41 | | hyenatown quits [Quit: Konversation terminated!] |
20:36:09 | | cascode quits [Ping timeout: 260 seconds] |
20:36:54 | | andrewnyr quits [Quit: The Lounge - https://thelounge.chat] |
20:39:49 | | andrewnyr joins |
21:13:59 | | linuxgemini quits [Read error: Connection reset by peer] |
21:14:03 | | linuxgemini3 (linuxgemini) joins |
21:38:13 | | etnguyen03 (etnguyen03) joins |
21:56:42 | | etnguyen03 quits [Client Quit] |
21:58:14 | | etnguyen03 (etnguyen03) joins |
22:14:14 | | Dada quits [Ping timeout: 240 seconds] |
22:24:19 | | Webuser566891 joins |
22:26:04 | <Webuser566891> | spanish webforum ejde.net is closing down at the end of the month. After 4 years and 4 months, but it was the sucessor of zonaforo meristation, which was created decades ago. Maybe it should be saved |
22:30:02 | <pokechu22> | I'll start an archivebot job. unfortunately https://www.ejde.net/core/xmlsitemap.php (from robots.txt) seems to be broken :/ |
22:30:41 | <pokechu22> | ugh, they're using cloudflare with TLS fingerprinting; we can't save it with archivebot :| |
22:33:17 | <Webuser566891> | :( |
22:33:32 | <Webuser566891> | maybe they could be told to disable that option? dunno |
22:34:17 | <h2ibot> | Pokechu22 edited Deathwatch (+230, /* 2025 */ https://www.ejde.net/): https://wiki.archiveteam.org/?diff=56630&oldid=56602 |
22:35:59 | | DogsRNice joins |
22:41:37 | | Webuser566891 quits [Client Quit] |
23:36:45 | | etnguyen03 quits [Client Quit] |