Just some Internet guy

He/him/them 🏳️‍🌈

  • 0 Posts
  • 389 Comments
Joined 1 year ago
cake
Cake day: June 25th, 2023

help-circle







  • I had to block ByteSpider at work because it can’t even parse HTML correctly and just hammers the same page and accounts to sometimes 80% of the traffic hitting a customer’s site and taking it down.

    The big problem with AI scrapers is unlike Google and traditional search engines, they just scrape so aggressively. Even if it’s all GETs, they hit years old content that’s not cached and use up the majority of the CPU time on the web servers.

    Scraping is okay, using up a whole 8 vCPU instance for days to feed AI models is not. They even actively use dozens of IPs to bypass the rate limits too, so theyre basically DDoS’ing whoever they scrape with no fucks given. I’ve been woken up by the pager way too often due to ByteSpider.

    My next step is rewriting all the content with GPT-2 and serving it to bots so their models collapse.


  • That’s pretty much why I made my own instance: nobody can take it away from me. I can ban whichever instance I deem hostile or don’t want content from. Nobody’s taking away my API anymore or shoving ads in my face.

    Nobody can pull a Reddit or Twitter on the fediverse, there will always be alternative instances to use putting pressure on the big ones to not drive away people.





  • I don’t want it to wake up in the middle of the night for no reason.

    What Windows have been doing the last couple years is they moved from regular sleep to some poorly implemented standby mode that works more like a phone does where it still runs just very power efficiently and still does stuff in the background. Macs have been doing that for a long time except they actually did it right so it doesn’t suck.

    Linux doesn’t support it yet so you’ll get classic stop the world sleep anyway, but either way it’ll always be customizable even when connected sleep gets implemented.







  • If downgrading the kernel fixes it then it sounds a lot like a kernel bug. Still worth reporting to libinput I guess, they’ll probably be of better help to report it to the kernel properly with details of what broke, if it ends up that way.

    If you really want to get involved you can also bisect the exact commit that caused it in the kernel, but that’s a lot of kernel compiling and rebooting ahead of you.


  • That shouldn’t be a problem. You can even install them on the same btrfs partition if you wanted to as long as each distro gets its own set of subvolumes for stuff. Separate partitions and even separate physical disks? No issues there, that’s even less weird.

    Ideally what I’d do in this scenario is at least make a subvolume for the Steam library that way you don’t have to mount the actual home folder, just the Steam library subvolume. I also have a separate subvolume for movies and TV shows, and a few other things. It’s just very convenient for organization purposes, and also technical purposes because now my home snapshots don’t take all that much space as all the big data stuff is separate. No point backing up a Steam library.

    But in the end, none of this really matters, you can mount anything just about anywhere. We all already mount a FAT32 at /boot or somewhere similar because UEFI requires it. The filesystems all have UUIDs which are usually used for configuring fstab and GRUB and whatnot, precisely so even if you physically swap the disks or even put it into another computer, it still works.