00:21:49 | | etnguyen03 quits [Client Quit] |
00:31:02 | <h2ibot> | Usernam edited List of websites excluded from the Wayback Machine/Partial exclusions (+70): https://wiki.archiveteam.org/?diff=57525&oldid=57486 |
00:34:17 | <nicolas17> | https://data.nicolas17.xyz/samsung-grab/ 12 pending |
00:42:39 | | anonymoususer852 quits [Ping timeout: 260 seconds] |
01:02:48 | | etnguyen03 (etnguyen03) joins |
01:49:10 | | SootBector quits [Remote host closed the connection] |
01:50:23 | | SootBector (SootBector) joins |
02:01:03 | | SootBector quits [Remote host closed the connection] |
02:02:08 | | SootBector (SootBector) joins |
02:29:19 | | sg72 joins |
02:29:27 | | Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…] |
02:35:09 | | Guest58 joins |
02:36:37 | | etnguyen03 quits [Client Quit] |
02:39:06 | | etnguyen03 (etnguyen03) joins |
02:48:59 | | Guest58 quits [Client Quit] |
02:50:31 | | Guest58 joins |
02:52:49 | | HP_Archivist quits [Read error: Connection reset by peer] |
02:57:06 | | etnguyen03 quits [Remote host closed the connection] |
03:08:53 | | Island quits [Read error: Connection reset by peer] |
03:54:04 | | Webuser929306 joins |
03:55:17 | | Webuser929306 quits [Client Quit] |
04:04:29 | | datechnoman quits [Ping timeout: 260 seconds] |
04:12:13 | | datechnoman (datechnoman) joins |
05:04:01 | | DogsRNice quits [Read error: Connection reset by peer] |
05:13:32 | | gosc joins |
05:53:05 | | Wohlstand quits [Quit: Wohlstand] |
06:06:24 | | Stagnant_ quits [Ping timeout: 260 seconds] |
06:16:20 | | Stagnant_ (Stagnant) joins |
06:28:31 | | Stagnant_ quits [Ping timeout: 258 seconds] |
06:38:06 | | Stagnant_ (Stagnant) joins |
06:59:34 | | gosc quits [Ping timeout: 258 seconds] |
07:13:49 | | gosc joins |
07:40:09 | | apache2 quits [Remote host closed the connection] |
07:41:37 | | apache2 joins |
07:52:14 | <h2ibot> | Hans5958 edited Tistory (-1): https://wiki.archiveteam.org/?diff=57526&oldid=57516 |
07:58:14 | <h2ibot> | Hans5958 edited Template:Project status templates (+1): https://wiki.archiveteam.org/?diff=57527&oldid=24048 |
07:58:15 | <h2ibot> | Hans5958 created Template:Shutting down (+30, Redirected page to [[Template:Closing]]): https://wiki.archiveteam.org/?title=Template%3AShutting%20down |
08:00:15 | <h2ibot> | Hans5958 edited Typepad (-3): https://wiki.archiveteam.org/?diff=57529&oldid=57439 |
08:00:16 | <h2ibot> | Hans5958 edited Typepad (+2): https://wiki.archiveteam.org/?diff=57530&oldid=57529 |
08:01:15 | <h2ibot> | Hans5958 edited SourceForge (+1052): https://wiki.archiveteam.org/?diff=57531&oldid=49502 |
08:01:16 | <h2ibot> | Hans5958 edited SourceForge (+0, /* Developer Web removal (2025) */): https://wiki.archiveteam.org/?diff=57532&oldid=57531 |
08:14:45 | | HackMii quits [Ping timeout: 255 seconds] |
08:16:36 | | HackMii (hacktheplanet) joins |
08:25:04 | | woans (woans) joins |
08:28:58 | | Radzig2 joins |
08:29:22 | | cscr-radio joins |
08:29:33 | <cscr-radio> | Hi, i'm trying to contact someone here about scraping my site |
08:29:52 | | cscr-radio quits [Client Quit] |
08:30:51 | | cscr-radio joins |
08:31:39 | | Radzig quits [Ping timeout: 260 seconds] |
08:31:39 | | Radzig2 is now known as Radzig |
08:36:19 | | Radzig2 joins |
08:36:56 | | Radzig quits [Ping timeout: 258 seconds] |
08:36:56 | | Radzig2 is now known as Radzig |
09:37:56 | | VerifiedJ quits [Quit: The Lounge - https://thelounge.chat] |
09:38:32 | | VerifiedJ (VerifiedJ) joins |
09:44:35 | <pabs> | cscr-radio: which site? |
10:12:23 | | woans quits [Ping timeout: 258 seconds] |
10:12:42 | | notarobot17 quits [Quit: Ping timeout (120 seconds)] |
10:12:57 | | notarobot17 joins |
10:50:00 | | JTL1 (JTL) joins |
10:50:29 | | JTL quits [Ping timeout: 260 seconds] |
10:54:02 | | FiTheArchiver joins |
11:00:02 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
11:00:37 | | FiTheArchiver quits [Client Quit] |
11:02:44 | | Bleo182600722719623455222 joins |
11:51:47 | <h2ibot> | Hans5958 edited Template:CTA URL lists (+512): https://wiki.archiveteam.org/?diff=57533&oldid=51470 |
11:53:47 | <h2ibot> | Hans5958 edited SourceForge (+154, /* Developer Web removal (2025) */): https://wiki.archiveteam.org/?diff=57534&oldid=57532 |
11:53:48 | <h2ibot> | Hans5958 edited Template:CTA URL lists (+2): https://wiki.archiveteam.org/?diff=57535&oldid=57533 |
11:53:49 | <h2ibot> | Hans5958 edited Template:CTA URL lists (-1): https://wiki.archiveteam.org/?diff=57536&oldid=57535 |
11:58:17 | <cscr-radio> | pabs: i'll reach out through another medium as my colleague is already comms. Thanks |
12:08:49 | <h2ibot> | Hans5958 created Deathwatch/Dead as a Doornail (+198722, Move as it is more of an archive to reduce load): https://wiki.archiveteam.org/?title=Deathwatch/Dead%20as%20a%20Doornail |
12:08:50 | <h2ibot> | Hans5958 edited Deathwatch (-198690, /* Dead as a Doornail */ Move to separate page…): https://wiki.archiveteam.org/?diff=57538&oldid=57524 |
12:17:38 | <gosc> | does anyone want to look into the adobe aero stuff? much too big for just me; found that the app uses this to find Aero experiences, so all the stuff made by these users can be saved https://www.behance.net/search?tools=883705687 |
12:35:52 | | Guest quits [Read error: Connection reset by peer] |
12:36:09 | | Guest joins |
12:37:22 | <cruller> | Is it true that “the app uses this to find Aero experiences”? For example, "yusukekashiwa" doesn't appear there. |
12:40:47 | <cruller> | Conversely, "madeup" appears there and https://cc-api-cp.adobe.io/api/v2/aero/users/madeup/assets?api_key=Aero_Content_Service1 give no results. |
12:47:48 | <cruller> | Even if that's the case, I think that's the best way to find users. |
12:49:54 | | twiswist_ (twiswist) joins |
12:52:59 | | twiswist quits [Ping timeout: 260 seconds] |
12:56:16 | <cruller> | s/Even if that's the case/Even if there are false positives and false negatives/ |
13:07:43 | | Wohlstand (Wohlstand) joins |
13:23:25 | | nine quits [Quit: See ya!] |
13:23:37 | | nine joins |
13:23:37 | | nine is now authenticated as nine |
13:23:38 | | nine quits [Changing host] |
13:23:38 | | nine (nine) joins |
13:27:34 | <gosc> | cruller, not all users, but a lot of them do show up |
13:28:14 | <gosc> | yusuke's work wasn't even on behance, so a lot of works aren't public; though the ones that are should be on that link to my understanding |
13:45:38 | | T31M quits [Quit: ZNC - https://znc.in] |
13:46:28 | | T31M joins |
13:46:30 | | T31M is now authenticated as T31M |
14:49:57 | <justauser|m> | I'm surprised Sourcegraph community isn't running in AB yet. Does someone handle it by hand? |
14:59:07 | | woans (woans) joins |
15:01:22 | | Island joins |
15:05:59 | | woans quits [Ping timeout: 260 seconds] |
15:12:59 | | dave quits [Ping timeout: 260 seconds] |
15:16:31 | <cruller> | gosc: “the ones that are” refers only to https://www.behance.net/gallery/124733809/UNIQLO-TOKYO-AR-EFFECT and https://www.behance.net/gallery/124713797/POWER-OF-FASHION-AR-ART-EFFECT, right? Those don't appear in the search results. https://www.behance.net/search/projects/POWER%20OF%20FASHION%20AR%20ART%20EFFECT?tools=883705687 |
15:19:34 | | sum1 joins |
15:19:50 | | sum1 quits [Client Quit] |
15:20:54 | <cruller> | Those individual gallery pages also don't contain links to https://www.behance.net/search?tools=883705687. |
15:25:22 | <cruller> | Logging in and endlessly scrolling through https://www.behance.net/search/projects/?tools=883705687 might reveal them, but I haven't tried it. |
15:33:47 | | Dada joins |
15:33:59 | | ducky quits [Ping timeout: 260 seconds] |
15:46:39 | <mgrandi> | https://natcast.org/ is winding down operations according to news sites, technically a government non profit |
15:46:57 | | Webuser842458 joins |
15:47:11 | | Webuser842458 quits [Client Quit] |
15:50:29 | | ducky (ducky) joins |
15:50:41 | <gosc> | cruller, sorry, I'm mistaken then |
15:51:05 | | cyanbox quits [Read error: Connection reset by peer] |
15:52:19 | | cscr-radio quits [Read error: Connection reset by peer] |
15:53:57 | <cruller> | No problem. I haven't found a better approach than it either. |
15:55:07 | <justauser|m> | JAA: Sourcegraph ^^^. |
16:07:47 | | woans (woans) joins |
16:08:09 | <@JAA> | AB job for https://community.sourcegraph.com/ is running now. |
16:22:14 | | kiska52 quits [Quit: Ping timeout (120 seconds)] |
16:22:19 | | @dxrt quits [Remote host closed the connection] |
16:22:32 | | kiska52 joins |
16:22:43 | | dxrt joins |
16:22:46 | | dxrt is now authenticated as dxrt |
16:22:46 | | dxrt quits [Changing host] |
16:22:46 | | dxrt (dxrt) joins |
16:22:46 | | @ChanServ sets mode: +o dxrt |
16:27:01 | | dave (dave) joins |
17:06:39 | <egallager> | is SPN giving anyone else errors around now currently? |
17:07:52 | <@JAA> | → #internetarchive |
17:09:34 | <egallager> | oh actually nvm, just a hiccup, it seems... |
17:11:40 | | Island_ joins |
17:13:09 | | Island quits [Ping timeout: 260 seconds] |
17:15:12 | | JTL1 is now known as JTL |
17:21:39 | <h2ibot> | Cooljeanius edited SourceForge (+100, Use URL template more): https://wiki.archiveteam.org/?diff=57539&oldid=57534 |
17:26:53 | | gosc quits [Quit: Leaving] |
17:26:57 | | Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…] |
17:29:02 | | Guest58 joins |
17:32:40 | <h2ibot> | Cooljeanius edited SourceForge (+8, oops, missed one): https://wiki.archiveteam.org/?diff=57540&oldid=57539 |
17:36:50 | | egallager quits [Quit: This computer has gone to sleep] |
17:50:06 | | egallager joins |
18:10:21 | | Webuser510251 joins |
18:10:34 | | Webuser510251 quits [Client Quit] |
18:21:01 | | Island joins |
18:24:12 | | Island_ quits [Ping timeout: 258 seconds] |
18:24:28 | | HackMii quits [Remote host closed the connection] |
18:24:45 | | HackMii (hacktheplanet) joins |
18:30:07 | | archiveDrill quits [Quit: The Lounge - https://thelounge.chat] |
18:31:44 | | archiveDrill joins |
18:46:26 | | woans quits [Ping timeout: 258 seconds] |
18:47:11 | | HackMii quits [Remote host closed the connection] |
18:47:27 | | HackMii (hacktheplanet) joins |
18:50:11 | | midou quits [Remote host closed the connection] |
18:50:13 | | midou joins |
18:56:39 | | woans (woans) joins |
19:00:18 | | woans quits [Client Quit] |
19:10:58 | | abirkill quits [Quit: Let us prepare to grapple with the ineffable itself, and see if we may not eff it after all.] |
19:17:37 | | abirkill (abirkill) joins |
19:26:18 | | abirkill quits [Ping timeout: 258 seconds] |
19:29:22 | | abirkill (abirkill) joins |
19:35:02 | | woans (woans) joins |
19:49:36 | | cyanbox joins |
20:20:59 | | HP_Archivist (HP_Archivist) joins |
20:21:42 | | hexagonwin quits [Read error: Connection reset by peer] |
20:23:14 | | hexagonwin joins |
20:25:39 | | hexagonwin quits [Read error: Connection reset by peer] |
20:27:09 | | hexagonwin joins |
20:33:53 | | cipherrot (petrichor) joins |
20:34:59 | | petrichor quits [Ping timeout: 260 seconds] |
20:35:23 | | Dada quits [Remote host closed the connection] |
20:39:17 | | hexagonwin quits [Read error: Connection reset by peer] |
20:41:19 | | hexagonwin joins |
20:50:44 | | Dango360 quits [Ping timeout: 260 seconds] |
20:51:12 | | Dango360 (Dango360) joins |
21:04:03 | | abirkill quits [Ping timeout: 258 seconds] |
21:07:48 | <Adamvoltagex|m> | https://mastodon.social/@FINOkoye/115294739327499612 |
21:17:30 | <nicolas17> | is there any JS monstrosity in those pages that needs SPN? |
21:28:03 | | etnguyen03 (etnguyen03) joins |
21:29:26 | | erkinalp joins |
21:30:52 | <erkinalp> | guilded is shutting down, any plans to archive public communities? |
21:31:00 | <masterx244|m> | regular users don't know the AT infra and assume that SPN is the only way to add to the WBM outside of the IA blackbox crawls |
21:31:20 | <erkinalp> | (requires special purpose crawlers due to login wall) |
21:31:50 | <nicolas17> | masterx244|m: yes, I'm wondering if I can tell them "stop using SPN, we'll run the whole thing through archivebot" or if it turns out archivebot wouldn't actually work |
21:32:12 | <Guest> | erkinalp that was just posted yesterday but afaik there isnt any plan yet (on deathwatch already though) |
21:32:45 | <masterx244|m> | no way to WBM due to JS hell. needs to be treaded like discord on the archiving workflow |
21:32:58 | <erkinalp> | there's takeout bots for both |
21:33:18 | <erkinalp> | i know the takeout structure for discord but guilded doesn't have such a data takeout feature |
21:33:28 | <nicolas17> | I'm talking about bcaexhibits not guilded |
21:36:52 | <Guest> | guilded has an api for data https://www.guilded.gg/docs/api/introduction |
21:36:59 | <erkinalp> | masterx244|m yep i know, the good thing is both discord and guilded have resumable websocket protocols which means you can rewind the ws gateway to zero then watch the dump building up |
21:37:06 | <erkinalp> | Guest hmm |
21:37:25 | <erkinalp> | then let's try that but i'm sure it only includes things sent by us |
21:37:41 | <Guest> | just realized thats actually the bot documentation |
21:38:11 | <Guest> | discord and guilded dont have publicly available user apis but you can build your own scraper if you want by monitoring network tab |
21:39:08 | <erkinalp> | you don't need to scrape actually, their websocket is resumable, just rewind to zero then wait it to fill up with all msgs and data |
21:39:28 | <erkinalp> | same trick also applies to discord |
21:40:06 | <erkinalp> | both discord and guilded have well made third party docs for human user apis |
21:45:26 | <mikolaj|m> | Polish-language forum about fantasy, largely dead, I checked a few old threads and couldn't find them preserved in Wayback Machine: http://www.wiezablaznow.pl/index.php |
21:45:40 | <mikolaj|m> | would be nice if someone threw it into ArchiveBot |
21:48:04 | <pokechu22> | mikolaj|m: queued |
21:48:16 | | ats quits [Read error: Connection reset by peer] |
21:48:22 | <mikolaj|m> | thanks |
21:52:38 | | etnguyen03 quits [Client Quit] |
21:53:17 | | Naruyoko5 joins |
21:55:25 | | Naruyoko quits [Ping timeout: 258 seconds] |
21:58:04 | | Naruyoko joins |
21:58:06 | | Naruyoko5 quits [Ping timeout: 258 seconds] |
21:58:34 | <Guest> | erkinalp: what do you mean by "rewind to 0"? |
22:00:58 | | ats (ats) joins |
22:02:21 | | beastbg8__ joins |
22:05:24 | | beastbg8_ quits [Ping timeout: 260 seconds] |
22:09:53 | | Dada joins |
22:11:49 | | hexagonwin quits [Read error: Connection reset by peer] |
22:12:39 | | hexagonwin joins |
22:32:08 | | etnguyen03 (etnguyen03) joins |
22:33:25 | | Dada quits [Remote host closed the connection] |
22:36:03 | | cyanbox quits [Read error: Connection reset by peer] |
22:58:11 | | hamouda joins |
23:21:30 | <cruller> | Omoroid's sitemap containing 5,290 items (technically MRSS format): https://giga.web.docomo.ne.jp/feed.xml |
23:28:57 | | etnguyen03 quits [Client Quit] |
23:31:07 | | cmlow0 joins |
23:33:10 | | cmlow quits [Ping timeout: 258 seconds] |
23:33:10 | | cmlow0 is now known as cmlow |
23:55:01 | | etnguyen03 (etnguyen03) joins |