00:07:34Arcorann (Arcorann) joins
00:10:29Arcorann quits [Remote host closed the connection]
00:14:21rocketdive quits [Remote host closed the connection]
00:16:32Arcorann (Arcorann) joins
00:32:53dm4v_ joins
00:34:03dm4v quits [Ping timeout: 265 seconds]
00:34:03dm4v_ is now known as dm4v
00:42:09driib quits [Quit: The Lounge - https://thelounge.chat]
00:42:36driib5 (driib) joins
01:02:17BlueMaxima joins
01:02:40dm4v_ joins
01:04:01dm4v quits [Ping timeout: 265 seconds]
01:04:01dm4v_ is now known as dm4v
01:09:24Gray_cat quits [Ping timeout: 246 seconds]
01:14:29Ryz20 (Ryz) joins
01:14:31s-crypt6 (s-crypt) joins
01:15:22Ryz2 quits [Client Quit]
01:15:22kiska quits [Client Quit]
01:15:22s-crypt quits [Client Quit]
01:15:22driib5 quits [Client Quit]
01:15:22dm4v quits [Client Quit]
01:15:22qwertyasdfuiopghjkl quits [Client Quit]
01:15:23Ryz20 is now known as Ryz2
01:15:23s-crypt6 is now known as s-crypt
01:15:27driib5 (driib) joins
01:15:27dm4v joins
01:15:28kiska (kiska) joins
01:24:44<@arkiver>let's also use #recordedjournal for livejournal channel
01:25:09<@arkiver>do we have a source on the latest livejournal news?
01:25:28qwertyasdfuiopghjkl joins
02:12:32Gray_cat joins
02:13:24wickedplayer494 quits [Remote host closed the connection]
02:23:07La_Gatta_Grigia joins
02:24:34Deewiant_ (Deewiant) joins
02:24:56driib5 quits [Client Quit]
02:24:56Gray_cat quits [Remote host closed the connection]
02:24:56sknebel quits [Remote host closed the connection]
02:24:56dm4v quits [Client Quit]
02:24:56Deewiant quits [Remote host closed the connection]
02:24:56qwertyasdfuiopghjkl quits [Client Quit]
02:24:59dm4v joins
02:25:02driib5 (driib) joins
02:25:48sknebel (sknebel) joins
02:29:29Deewiant_ is now known as Deewiant
02:32:10driib59 (driib) joins
02:32:23driib5 quits [Read error: Connection reset by peer]
02:32:23driib59 is now known as driib5
02:35:28wickedplayer494 joins
03:29:59nico_32 quits [Ping timeout: 265 seconds]
03:58:16nico_32 (nico) joins
04:20:48Discant joins
04:29:33La_Gatta_Grigia is now known as Gray_cat
04:33:52<Gray_cat>Similar question to rocketdive's - how come 10 GB was downloaded and only 4 unloaded for the telegram project? 6 GB of stylesheets got stripped away and only the message content was sent up?
04:37:04Discant quits [Remote host closed the connection]
04:37:25Discant joins
04:40:56qwertyasdfuiopghjkl joins
04:58:02Discant quits [Ping timeout: 245 seconds]
05:11:47<@OrIdow6>Compression
05:13:52Gray_cat quits [Ping timeout: 245 seconds]
05:14:27<@OrIdow6>Always a ping timeout
05:24:28<Jake>Heh
05:36:18Discant joins
06:21:43tzt (tzt) joins
08:42:53Discant quits [Remote host closed the connection]
08:42:53dm4v quits [Client Quit]
08:42:54qwertyasdfuiopghjkl quits [Client Quit]
08:42:58dm4v_ joins
08:43:01DiscantX joins
08:43:28dm4v_ is now known as dm4v
08:53:03BlueMaxima quits [Client Quit]
09:27:10DiscantX quits [Ping timeout: 265 seconds]
09:32:11qwertyasdfuiopghjkl joins
09:37:47DiscantX joins
11:08:23mohsaid joins
11:08:49<mohsaid>Hi
11:09:32<@OrIdow6>Hello mohsaid, what can we do for you?
11:10:01<mohsaid>I have a question.
11:10:40<@rewby>We may have answers if you ask your question. :)
11:15:22<@rewby>Announcement: I'm doing maintenance on my infrastructure momentarily. Both my IRC and several systems required for target management will be offline for a few hours. Targets should keep running but if anything stops functioning, I can't fix it until this maintenance is done.
11:16:58<mohsaid>My website (occlub.org) had been archived automatically on the wayback machine, and it says that the website had been archived originally by the (Archive Team). My question is, does that mean that the Archive Team had a copy of my website?
11:18:04<@OrIdow6>mohsaid: ArchiveTeam generally does not keep copies of the websites we archived as we do not have enough storage ourselves; rather we upload them to archive.org
11:18:14<@rewby>We had a copy of the site as it was visible to the public. Depending on how we did the archive, we may have not kept the copy after we uploaded it to the IA.
11:18:49<@OrIdow6>Also, we may have made and uploaded a copy of a single page, or of the whole site, or of a section
11:19:05<@rewby>Judging by the dataset being archiveteam_urls, it'll be single pages
11:19:20<@OrIdow6>That's what it looks like
11:19:28<@OrIdow6>And news.html isn't in the WBM at all
11:19:32<@rewby>Yeah
11:19:34<@OrIdow6>*.php
11:19:56<@rewby>So no, we don't have a copy anymore because #// moves so fast
11:21:02<mohsaid>Thank you for your assistance:)
11:21:09<thuban>why do you ask?
11:22:05<mohsaid>Because I'm curious how they got to my website.
11:22:47<thuban>the urls project (channel #//, collection archiveteam_urls) collects outlinks from a variety of sources
11:22:58<@rewby>^
11:23:06<@rewby>outlinks being, someone linked to it
11:23:09<@rewby>So maybe someone posted it on reddit
11:23:15<@rewby>Or we found it on social media somewhere
11:23:35<@rewby>Or it was found as part of another project
11:24:05<mohsaid>Yes, I posted it on my Facebook group.
11:24:17<@OrIdow6>We don't do Facebook AFAIK
11:24:35<@OrIdow6>Too hard for a variety of reasons
11:26:10rewby|backup (rewby) joins
11:26:10@ChanServ sets mode: +o rewby|backup
11:28:48<mohsaid>Also, I am interested in whether you use a crawler bot or something like this to crawl websites that have been posted on (channel #//)?
11:29:03<@rewby>We have a big distributed crawler, yeah
11:29:17<@rewby>If you're interested, we can tell one of our bots to crawl and archive your entire site.
11:30:42<mohsaid>No problem if you want to
11:31:29<mohsaid>And what is the name of the crawler?
11:31:48<@OrIdow6>The one for smaller-scale sites like this is ArchiveBot
11:31:59<@OrIdow6>Well, the one rew_by's talking about
11:32:07<@rewby>Yep
11:32:44<mohsaid>Is that crawl open source?
11:33:01<thuban>https://wiki.archiveteam.org/index.php?title=ArchiveBot
11:33:11<thuban>yep! https://github.com/ArchiveTeam/ArchiveBot
11:34:08<mohsaid>Thank you everyone for the help :)
11:34:19<thuban>you're welcome!
11:34:44<@rewby>Always happy to help!
11:34:53<mohsaid>That crawl will help me a lot.
11:35:14@rewby quits [Quit: WeeChat 3.5]
11:35:22mohsaid quits [Remote host closed the connection]
11:40:07DiscantX quits [Ping timeout: 245 seconds]
11:59:21@rewby|backup quits [Ping timeout: 246 seconds]
12:11:26rewby|backup (rewby) joins
12:11:26@ChanServ sets mode: +o rewby|backup
12:32:10rocketdive (rocketdive) joins
12:50:01<datechnoman>Could someone possibly share the docker creation command/script they are using to run the docker containers fully out of RAM using tmpfs? Looking to bypass I/O issues on disks if possible.
13:20:36<Sluggs>im using this on a few machines with local hdd
13:20:40<Sluggs>docker run -d --name telegram --label=com.centurylinklabs.watchtower.enable=true -v '/dev/shm/telegram':'/grab/data':'rw' --restart=unless-stopped atdr.meo.ws/archiveteam/telegram-grab --concurrent 4 Sluggs
13:31:17<@JAA>datechnoman: hackint/#down-the-tube 2021-11-30 18:27:48 UTC < ThreeHM> Hmm, good idea to use tmpfs for reducing disk load. Turns out docker has an option for that: "docker run --mount type=tmpfs,tmpfs-size=2G,destination=/grab/data ..."
13:32:34dm4v quits [Client Quit]
13:33:00dm4v joins
13:33:08rocketdive quits [Client Quit]
13:34:17Arcorann quits [Ping timeout: 245 seconds]
13:40:51@rewby|backup quits [Ping timeout: 246 seconds]
13:56:24mohsaid joins
13:56:43mohsaid quits [Remote host closed the connection]
14:11:45sec^nd quits [Remote host closed the connection]
14:12:15sec^nd (second) joins
14:15:29qwertyasdfuiopghjkl quits [Client Quit]
14:53:32dm4v quits [Client Quit]
14:53:36dm4v joins
15:01:06rtro joins
15:01:20rtro quits [Remote host closed the connection]
15:10:25HackMii quits [Remote host closed the connection]
15:13:32HackMii (hacktheplanet) joins
15:20:57HackMii quits [Ping timeout: 245 seconds]
15:22:27HackMii (hacktheplanet) joins
15:35:22Gray_cat joins
15:50:38rewby (rewby) joins
15:50:38@ChanServ sets mode: +o rewby
15:58:53rocketdive (rocketdive) joins
16:06:30LeGoupil joins
16:48:21<h2ibot>Jakiki6 edited Talk:Main Page (+169, /* Mirroring to IPFS */ new section): https://wiki.archiveteam.org/?diff=48686&oldid=48337
16:48:22<h2ibot>Entartet edited List of websites excluded from the Wayback Machine (+25, Added petitcolas.net.): https://wiki.archiveteam.org/?diff=48687&oldid=48685
16:49:21<h2ibot>7GLCS6S edited YouTube (+159): https://wiki.archiveteam.org/?diff=48688&oldid=48451
16:51:50rocketdive quits [Ping timeout: 265 seconds]
17:36:25sec^nd quits [Remote host closed the connection]
17:37:26sec^nd (second) joins
18:02:21marto_ quits [Quit: The Lounge - https://thelounge.chat]
18:35:14rocketdive (rocketdive) joins
18:56:54tzt quits [Ping timeout: 246 seconds]
19:04:42michaelblob_ quits [Ping timeout: 245 seconds]
20:23:32SketchTheCow quits [Ping timeout: 265 seconds]
20:58:37Gray_cat quits [Remote host closed the connection]
20:58:51Gray_cat joins
21:03:57LeGoupil quits [Client Quit]
21:04:32rocketdive quits [Remote host closed the connection]
22:01:06onetruth joins
22:05:56Arcorann (Arcorann) joins
22:19:03Arcorann quits [Ping timeout: 265 seconds]
22:20:40<datechnoman>Thanks for that info all!
22:21:01<datechnoman>just what I was after
22:27:17Arcorann (Arcorann) joins
22:40:28Arcorann quits [Ping timeout: 276 seconds]
22:40:57Gray_cat quits [Ping timeout: 245 seconds]
22:41:22michaelblob (michaelblob) joins
22:42:59Arcorann (Arcorann) joins
22:47:12Shjosan quits [Ping timeout: 245 seconds]
22:47:40Shjosan (Shjosan) joins
23:32:10qwertyasdfuiopghjkl joins
23:48:23BlueMaxima joins