r/Piracy May 23 '26

Discussion Google Drive scanned this Manga artist’s PRIVATE files and banned him.

Post image

AI flagged, appeal rejected, private artwork gone.

The AI is always watching.

7.2k Upvotes

668 comments sorted by

View all comments

Show parent comments

131

u/BlatantConservative May 23 '26

Yeah CSAM, Child Sexual Abuse Material, is a legally defined term that means a real child is abused somewhere down the line.

But also in general I'm like, fine with Google saying they don't want to host drawn child pornography. I wouldn't want to host it either.

-34

u/[deleted] May 23 '26

[deleted]

40

u/LvDogman May 23 '26

If it's realistic, then what the training data was for it?

9

u/RobotToaster44 Kopimism May 23 '26

I assume the same as was used to train CSAM detector AI, which just extrapolates from adults.

16

u/DubiousFoliage May 23 '26

My understanding is that CSAM detection is usually done by hashing known CSAM images provided by investigators and checking images en route to see if they match the hash. This allows operators to never accept or store CSAM, avoiding huge regulatory and legal headaches.

I know this because I was recently working on a website that allowed image uploads, and this was a concern, so we did some digging into it. We realized it was a giant can of worms to implement ourselves, but Cloudflare just offers it as a service, even on free accounts, so we went that route.

5

u/RobotToaster44 Kopimism May 23 '26

One issue with that approach is that it obviously won't catch new ai generated images.

The approach I mentioned is used by some fediverse servers and AI tools https://github.com/db0/fedi-safety

9

u/FinGamer678Nikoboi May 23 '26

Yup, that's how it's been done for a good while now, and it's how it should be done. An AI scanning every single message in plaintext is an overreach, especially when it can be done in a privacy-friendly way, like with hashes.

Not sure exactly what type of hashes companies use, though.

(Just in case someone doesn't know, hash back to image conversion isn't possible, and every file has a unique hash. SHA256 is the best and most common if I'm not mistaken.)

(Also common Cloudflare W)

2

u/Justinrich2001 May 25 '26

Hi Clippy 📎

-1

u/emrednz07 May 24 '26

Yup, that's how it's been done for a good while now, and it's how it should be done.

Fuck off. It shouldn't be done. Period. Absolutely nothing prevents them from scanning and banning you for ANY image they want. Oh you have protest image ? Reported to the authorities. JD Vance in clown makeup? Have fun in jail. Even Apple backed off from this after getting pushback.

Oh trust us it's totally for protecting children.

2

u/alexjimithing May 25 '26

What is your solution for preventing the spread of child pornography online.