01:27:49 | | dabs quits [Client Quit] |
01:49:16 | | etnguyen03 (etnguyen03) joins |
01:56:00 | | emanuele6 quits [Read error: Connection reset by peer] |
02:23:09 | | etnguyen03 quits [Client Quit] |
02:24:20 | | etnguyen03 (etnguyen03) joins |
02:45:49 | | pabs quits [Ping timeout: 260 seconds] |
02:52:07 | | Wohlstand quits [Quit: Wohlstand] |
02:54:00 | | ducky quits [Remote host closed the connection] |
02:58:36 | | pabs (pabs) joins |
03:33:32 | | fangfufu joins |
03:33:36 | | fangfufu is now authenticated as fangfufu |
03:35:26 | | etnguyen03 quits [Remote host closed the connection] |
04:00:11 | | Guest58 joins |
04:04:35 | <pabs> | https://splits.io/ shut down, but the data is still accessible. site is very scripty though |
04:17:03 | | fangfufu quits [Client Quit] |
04:19:22 | <pabs> | bing.com has 300k site:splits.io results, but it seems pretty hard to scrape them |
04:21:05 | <pabs> | some of the files look quite enumerable, DPoS could probably brute-force them |
04:21:06 | <pabs> | https://s3.amazonaws.com/splits.io-runid-to-s3filename/by_id10/1127.json |
04:21:13 | | fangfufu joins |
04:21:13 | <pabs> | https://s3.amazonaws.com/splits.io-runid-to-s3filename/by_username/glacials.json |
04:21:17 | | fangfufu is now authenticated as fangfufu |
04:21:17 | <pabs> | https://s3.amazonaws.com/splits.io-runid-to-s3filename/categories/by_id/11110.json |
04:21:22 | <pabs> | https://s3.amazonaws.com/splits.io-runid-to-s3filename/games/by_id/175.json |
04:21:29 | <pabs> | https://s3.amazonaws.com/splits.io/splits/10x |
04:22:55 | <pabs> | others not so much: https://s3.amazonaws.com/splits.io/splits/0c2f2410-cee1-4100-8070-2acedc5890d6 |
04:23:41 | <pabs> | does anyone know of a good way to scrape Bing? their pagination seems to be all sorts of wonky in a browser :/ |
04:25:29 | <pabs> | hmm, might just email them |
04:26:21 | | benjins3_ quits [Read error: Connection reset by peer] |
04:27:44 | | benjins3_ joins |
04:37:14 | | Guest joins |
04:43:56 | <pabs> | !tell trollface2006 contacted splits.io admin about saving the remaining data |
04:43:57 | <eggdrop> | [tell] ok, I'll tell trollface2006 when they join next |
04:56:37 | | Guest58 quits [Client Quit] |
05:03:44 | | ducky (ducky) joins |
05:21:00 | <h2ibot> | PaulWise edited Internet Archive/Save Page Now (+52, add info about ad blocker /cc Jake): https://wiki.archiveteam.org/?diff=56862&oldid=56861 |
05:44:14 | | dhinakg quits [Quit: dhinakg] |
05:47:13 | | DogsRNice quits [Read error: Connection reset by peer] |
06:17:25 | | BornOn420 (BornOn420) joins |
06:27:16 | | BornOn420 quits [Client Quit] |
06:30:36 | | Guest58 joins |
06:46:03 | | Webuser613747 joins |
06:46:48 | | awauwa (awauwa) joins |
06:50:00 | | Island quits [Read error: Connection reset by peer] |
06:52:43 | | Webuser613747 quits [Client Quit] |
06:56:06 | | BornOn420 (BornOn420) joins |
06:57:34 | | nicolas17 quits [Quit: Konversation terminated!] |
07:00:44 | | BornOn420 quits [Ping timeout: 260 seconds] |
07:15:22 | | BornOn420 (BornOn420) joins |
07:44:35 | | emanuele6 (emanuele6) joins |
08:04:04 | | Webuser018602 joins |
08:13:37 | | Webuser018602 quits [Client Quit] |
08:43:00 | | Guest58 quits [Client Quit] |
08:43:28 | | Guest58 joins |
08:52:44 | | emanuele6 quits [Read error: Connection reset by peer] |
08:59:09 | | HP_Archivist quits [Ping timeout: 260 seconds] |
09:01:25 | | Guest quits [Quit: Ooops, wrong browser tab.] |
09:18:07 | | hamouda joins |
09:18:50 | | emanuele6 (emanuele6) joins |
09:24:08 | <hamouda> | pokechu22 hii, I have tested the downloaded WARCS and they working great with no issues. the converting process also is fine. didn't have to rename files. I wanna tell you there are just two forums need to be archived, they no less valuable than others on the same domain. Could you help me archiving them? I need the first one to be one WARC file |
09:24:08 | <hamouda> | and the other forum to be archived with another task. first forum url: |
09:24:08 | <hamouda> | https://al-maktaba.org/book/31871 |
09:24:08 | <hamouda> | the second one: |
09:24:08 | <hamouda> | https://al-maktaba.org/book/31882 |
09:24:09 | <hamouda> | https://al-maktaba.org/book/31874 |
09:24:09 | <hamouda> | https://al-maktaba.org/book/31862 |
09:25:48 | <hamouda> | thank you for your patience. |
09:42:14 | | emanuele6 quits [Read error: Connection reset by peer] |
09:54:45 | | Guest58 quits [Client Quit] |
09:58:36 | | IDK (IDK) joins |
09:58:49 | <hamouda> | the second forum was this one on wayback M, https://web.archive.org/web/20100305171817/http://www.alfaseeh.com:80/vb |
10:02:47 | | NatTheCat6 (NatTheCat) joins |
10:04:34 | | NatTheCat quits [Ping timeout: 240 seconds] |
10:04:34 | | NatTheCat6 is now known as NatTheCat |
10:06:25 | | BornOn420 quits [Read error: Connection reset by peer] |
10:06:55 | | BornOn420 (BornOn420) joins |
10:12:08 | <hamouda> | the second one on wway back M was https://web.archive.org/web/20130227111516/http://tafsir.net/vb/ |
10:27:47 | | Guest58 joins |
10:32:17 | | beastbg8_ joins |
10:35:59 | | beastbg8__ quits [Ping timeout: 260 seconds] |
10:58:49 | | threedeeitguy69 quits [Quit: The Lounge - https://thelounge.chat] |
11:00:03 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
11:01:54 | <h2ibot> | Manu edited Discourse (+55, Active Discourses: Add Docker Community Forums): https://wiki.archiveteam.org/?diff=56863&oldid=56802 |
11:02:51 | | Bleo182600722719623455222 joins |
11:17:24 | | Matthww quits [Ping timeout: 260 seconds] |
11:18:09 | | emanuele6 (emanuele6) joins |
11:29:58 | <h2ibot> | Bzc6p edited Talk:Main Page (+582, /* Removing Tracker links and Adding data…): https://wiki.archiveteam.org/?diff=56864&oldid=56843 |
11:31:16 | | Matthww joins |
11:44:25 | | Dada joins |
11:53:27 | | hamouda quits [Quit: Ooops, wrong browser tab.] |
11:58:40 | | hamouda joins |
11:59:20 | | hamouda quits [Client Quit] |
12:02:07 | | threedeeitguy69 (threedeeitguy) joins |
12:21:40 | | Guest58 quits [Client Quit] |
12:22:01 | | Guest58 joins |
12:27:06 | <h2ibot> | Manu edited ArchiveBot/Monitoring (+143, Add publish.obsidian.md monitoring): https://wiki.archiveteam.org/?diff=56865&oldid=56814 |
12:46:42 | | etnguyen03 (etnguyen03) joins |
12:48:07 | | yano quits [Quit: WeeChat, https://weechat.org/] |
12:50:20 | | yano (yano) joins |
12:59:32 | | Webuser572592 joins |
13:00:18 | | Webuser572592 quits [Client Quit] |
13:10:07 | | Guest58 quits [Client Quit] |
13:10:20 | | Wohlstand (Wohlstand) joins |
13:17:12 | | Guest58 joins |
13:29:47 | | etnguyen03 quits [Client Quit] |
13:41:43 | | hamouda joins |
13:42:04 | | emanuele6 quits [Ping timeout: 260 seconds] |
13:42:39 | | hamouda quits [Client Quit] |
13:56:38 | | Dada quits [Remote host closed the connection] |
13:59:47 | | etnguyen03 (etnguyen03) joins |
14:11:27 | <h2ibot> | Cruller edited Deathwatch (+270, Add LaCoocan free plan): https://wiki.archiveteam.org/?diff=56866&oldid=56677 |
15:01:15 | | dhinakg (dhinakg) joins |
15:10:20 | | hexagonwin quits [Read error: Connection reset by peer] |
15:12:07 | | hexagonwin joins |
15:12:58 | | Guest58 quits [Client Quit] |
15:13:34 | | TheEnbyperor quits [Ping timeout: 240 seconds] |
15:13:37 | | TheEnbyperor_ is now known as TheEnbyperor |
15:13:43 | | TheEnbyperor_ joins |
15:20:14 | | TheEnbyperor_ quits [Ping timeout: 240 seconds] |
15:20:22 | | TheEnbyperor_ joins |
15:31:54 | | etnguyen03 quits [Client Quit] |
15:35:27 | | TunaLobster quits [Quit: So long and thanks for all the fish] |
15:37:21 | | grill (grill) joins |
15:38:58 | | Webuser745792 joins |
15:39:19 | | TunaLobster joins |
15:39:57 | <Webuser745792> | !a https://vimeo.com/1083170371/cb1c4a74d4?share=copy -e "removed video reupload" |
15:43:51 | | Guest joins |
15:51:08 | | hamouda joins |
15:52:03 | | hamouda quits [Client Quit] |
16:04:44 | | FiTheArchiver joins |
16:05:23 | | FiTheArchiver quits [Client Quit] |
16:12:24 | <Webuser745792> | Thanks. |
16:12:30 | <Webuser745792> | Goodbye. |
16:12:42 | | Webuser745792 leaves |
16:17:45 | <h2ibot> | Monika edited List of websites excluded from the Wayback Machine (+31, Add www.codeztslabel.com): https://wiki.archiveteam.org/?diff=56867&oldid=56826 |
16:29:40 | | DogsRNice joins |
16:30:15 | | grill quits [Client Quit] |
16:49:19 | | dabs joins |
17:02:29 | | etnguyen03 (etnguyen03) joins |
17:17:54 | <h2ibot> | Fusl edited Anubis/uncategorized (+30): https://wiki.archiveteam.org/?diff=56868&oldid=56531 |
17:18:54 | <h2ibot> | Fusl edited Anubis/uncategorized (+1): https://wiki.archiveteam.org/?diff=56869&oldid=56868 |
17:20:03 | | etnguyen03 quits [Client Quit] |
17:31:59 | | Dada joins |
17:38:59 | | HP_Archivist (HP_Archivist) joins |
17:44:30 | | ducky quits [Remote host closed the connection] |
17:45:46 | | hamouda joins |
17:45:46 | | ducky (ducky) joins |
17:47:41 | <hamouda> | pokechu22 hiii |
17:47:57 | <pokechu22> | Hi, I saw your messages and will start jobs shortly |
17:49:37 | <hamouda> | thank you so much, I told you because you know already how the domain works. |
18:00:27 | <pokechu22> | hamouda: jobs started, WARCs will be at https://archive.fart.website/archivebot/viewer/job/41rtt for tafsir.net and https://archive.fart.website/archivebot/viewer/job/d3g5v for alfaseeh.com |
18:04:35 | <hamouda> | thank you! what is the url for the tasks themselves? |
18:06:19 | <pokechu22> | http://archivebot.com/?initialFilter=al-maktaba.org |
18:11:44 | <hamouda> | please note that there was a typo when I wrote the second forum on way back M for the second line (sentence) . I meant the first forum. the second forum was this one on wayback M, https://web.archive.org/web/20100305171817/http://www.alfaseeh.com:80/vb |
18:11:44 | <hamouda> | 13:12 <hamouda> the first one on way back M was https://web.archive.org/web/20130227111516/http://tafsir.net/vb/ I've edited this with the first one. |
18:12:11 | <pokechu22> | Yes, I figured that out :) |
18:12:42 | | TunaLobster quits [Client Quit] |
18:13:01 | <pokechu22> | I think the two jobs are named correctly (I checked for matching forum threads) but I can't read the arabic script so I'm not 100% sure |
18:13:28 | | TunaLobster joins |
18:13:32 | <hamouda> | great! I wish you all the best. |
18:15:51 | | awauwa quits [Quit: awauwa] |
18:18:20 | | ducky quits [Remote host closed the connection] |
18:18:37 | | ducky (ducky) joins |
18:19:59 | <hamouda> | they're named correctly, I have checked the urls and job names. |
18:20:36 | | ducky_ (ducky) joins |
18:23:14 | | ducky quits [Ping timeout: 260 seconds] |
18:23:14 | | ducky_ is now known as ducky |
18:55:20 | <ericgallager> | https://thisweekinvideogames.com/feature/video-game-history-foundation-on-the-significance-of-computer-entertainer/ |
19:05:45 | | ducky quits [Read error: Connection reset by peer] |
19:07:26 | | ducky (ducky) joins |
19:11:07 | | hackbug quits [Remote host closed the connection] |
19:13:49 | | hackbug (hackbug) joins |
19:24:20 | | wickedplayer494 is now authenticated as wickedplayer494 |
20:02:55 | | IDK quits [Quit: Connection closed for inactivity] |
20:06:29 | | nicolas17 joins |
20:07:04 | | hamouda quits [Client Quit] |
20:16:18 | | Island joins |
20:31:24 | | Wohlstand quits [Quit: Wohlstand] |
21:21:17 | | linuxgemini (linuxgemini) joins |
21:27:47 | | a-dude joins |
21:27:58 | | a-dude quits [Remote host closed the connection] |
21:59:31 | | Dada quits [Remote host closed the connection] |
22:03:22 | | etnguyen03 (etnguyen03) joins |
22:30:39 | | Webuser657159 joins |
22:31:57 | | Webuser657159 quits [Client Quit] |
22:45:28 | | etnguyen03 quits [Client Quit] |
22:48:49 | | etnguyen03 (etnguyen03) joins |
22:55:39 | | Guest quits [Quit: Ooops, wrong browser tab.] |
22:59:41 | | DopefishJustin quits [Remote host closed the connection] |
23:06:35 | | DopefishJustin joins |
23:06:35 | | DopefishJustin is now authenticated as DopefishJustin |
23:46:45 | | etnguyen03 quits [Client Quit] |
23:47:07 | | etnguyen03 (etnguyen03) joins |
23:54:53 | | Guest58 joins |
23:56:52 | | etnguyen03 quits [Client Quit] |
23:57:14 | | etnguyen03 (etnguyen03) joins |