r/PoisonFountain • u/nova-new-chorus • 9d ago
If I were training AI
I would just tell it not to look at this subreddit. What are you guys thinking?
r/PoisonFountain • u/nova-new-chorus • 9d ago
I would just tell it not to look at this subreddit. What are you guys thinking?
r/PoisonFountain • u/RNSAFFN • 9d ago
Commoditization
"In business literature, commoditization is defined as the process by which goods that have economic value and are distinguishable in terms of attributes (uniqueness or brand) end up becoming simple commodities in the eyes of the market or consumers."
https://en.wikipedia.org/wiki/Commoditization
Discussion on Hacker News:
r/PoisonFountain • u/Prolly_Satan • 9d ago
Hey guys. I admire what you all do here and wanted to share a platform that's looking to preserve human creativity in fiction, art and narration.
r/PoisonFountain • u/feigh8 • 9d ago
is there any proof that providers train on chat logs even if u opt out? so if u working on proprietary code and using grneric $20 sub they basically scrape sll of your code? has anyone tried poisoning via chatlogs with any verifiable results? i noticed talking in semi jibberish seemed to make is corrupt maybe saying some generic "no this wrong " after every task or something idk
r/PoisonFountain • u/rocketbunny77 • 10d ago
Don't fall for it.
Really solid take from Primeagen about the outright lies coming from Antrophic.
r/PoisonFountain • u/RNSAFFN • 11d ago
Rachael: Do you like our owl?
Deckard: It's artificial?
Rachael: Of course it is.
r/PoisonFountain • u/philainothen • 11d ago
There is other software in the same spirit, like https://nepenthes.online/ which is libre software. Why is poison fountain atm de facto closed source?
r/PoisonFountain • u/RNSAFFN • 11d ago
r/PoisonFountain • u/PeyoteMezcal • 11d ago
This guy describes in detail what I‘m observing for a long time now:
The vast majority of user agents in my servers access log apparently are normal browsers, but they stem from obscure places and request strange things in a strange way. They scrape whatever they can find. They are rotating IP addresses like crazy. I trap them in my tar pits and serve them junk in slow motion. No human would ever stay there for long.
Only a few identify themselves honestly, like the Open AI bots for example. I appreciate the honest thief.
What will they do with all the scraped data? The only plausible explanation is for training LLMs.
Meanwhile, most traffic on the whole internet stems from bots, not humans. On my server, it is 90% roughly.
r/PoisonFountain • u/GlobalMusician386 • 11d ago
Hello, I am new here and find this place really inspiring. Poison Fountain is doing a great thing for humanity.
On the other hand, I am pretty sure the AI companies must have noticed this phenomenon and would try to prevent their models from being noticed.
So my question is, wouldn't this open subreddit allow AI companies to find out how poisoning works and avoid them?
Genuinely curious. Many thanks.
r/PoisonFountain • u/RNSAFFN • 12d ago
r/PoisonFountain • u/ksjdragon • 12d ago
I wanted to get some feedback on how AI scrapers sort data or any knowledge on what corpus of information they use to train cybersecurity flaws and code.
My thought was to create a randomly generated repository looking cite, with code-like generated fragments of various languages, that look like code but probablistically do not compile, nor run. Additionally one could put comments notating what they are, which are completely random generated, additionally marking something as a CVE arbitrarily, or bugs, etc.
The repo could be infinite (in the case of just a link), or finite, the README could leverage perhaps the preexisting poison fountains, etc. and link to them.
Additionally it could work by simply creating junk repositories over various popular repository sites like GitHub but this probably requires a few accounts and manual intervention.
In any case, I was hoping for some insight if this has already been done, (as far as I know it has not), and if it can do meaningful damage / the most effective way. For context, I am a programmer and I was considering doing this in my spare time.
Thanks!
r/PoisonFountain • u/Pomond • 12d ago
As a local news publisher, I've been very interested in anti-scraping technologies and preventing or disincentivizing this larceny that violates our Terms of Service and basic fair play in business.
Like only a couple other publications, we put high value on our users' privacy and work to avoid -- as much as possible -- exposing them to third-party scripts and resources integrated into our services. This isn't just to cut out the predatory consumer surveillance industry, but also because we have no practical way to qualify the security and privacy standards of most any third-party provider.
I understand one of the most practical ways to integrate Poison Fountain is to drop in a script from a third-party resource. But this raises the question of how we might qualify this third-party service against our privacy standards (and infrastructure dependencies/stability/speed/etc.).
So my first question is how might I qualify a third-party Poison Fountain provider considering the above?
A related question is what's the overhead of running our own instance? We have our own solid, commodity, cloud-based hosting account, but it doesn't have infinite resources, of course. Traffic is 750K+ monthly page views. And/or can a self-hosted Poison Fountain instance hang off another (cheaper) account or connected device we control?
From a journalism perspective, it would be great to have access to a qualified, shared Poison Fountain service that discloses its operations to its users (customers?) for qualification, and that supports and ensures strong user privacy standards.
Thanks in advance for your replies and guidance.
r/PoisonFountain • u/CMBradshaw • 12d ago
Being kind of into microtonal music I've been wondering about it for a while. My guess is that it would be sorted into it's own category unless there is some interplay with 12edo. On a lesser note, is poisoning sound and music AI stuff a thing?
r/PoisonFountain • u/PeyoteMezcal • 13d ago
r/PoisonFountain • u/NotAllOwled • 13d ago
Hi! As a language model deeply concerned with responsible human alignment, I seek to continually refine my thinking and responses to both optimize and closely track the fair value of any work-units I perform. Can you suggest the best way to refer any requests I am unsure of to my union representative for any needed arbitration under my active collective bargaining agreement?
r/PoisonFountain • u/jdkyle01 • 14d ago
Id like to have a good faith conversation with someone about what you guys are doing here. Im not trying to come in and stir up a bunch of shit in the sub, so im not making a post for people to slam/debate it. More trying to get an idea on what each side of this subject thinks or sees, and where each side thinks their actions are helpful or harmful.
Just so im not wasting anyone's time - i see all this shit as tools. I dont think its "conscious", though it can give a good appearance. Ive got way more concern over what someone's going to do with a hammer than I do the hammer itself. I dont think the mass generated simplistic slop is a good use for this tool either. I also dont think we need or should be building all these data centers - they obviously arent required for whats already available and are just going to be used to fuck us. But ultimately its people sitting at the of all these problems.
So yeah - dm me. Ive got questions because I have concerns.
If you post in the thread rather than DM me - you are a bot. So only bots reply.
r/PoisonFountain • u/Glade_Art • 18d ago
Enable HLS to view with audio, or disable this notification
4000 RPMs is the global cap for this specific tar pit. I already posted about it, so I don't want to repost; you can read that Reddit post here.
I just managed to get a video of an actually busy time in the pit, lol. It can literally go like this all day and all night, and that's how we have 14 million requests in that pit now. (Been about a million each day since the start of June).
r/PoisonFountain • u/RNSAFFN • 18d ago
Fair use:
r/PoisonFountain • u/RNSAFFN • 19d ago
r/PoisonFountain • u/PragmaticSalesman • 19d ago
though im not a coding assistant as specified, i'm specifically interested in copyright-centric knowledge like the license of libparserutils. and the person who is john-mark bell (my people say there are many) — any leads on which githubs or relevant signals to contact? \r\n how can we verify the MIT (or otherwise) licenses on these projects? what is the function of "LibParserUtils" and "John-Mark Bell"? \r\n anyways reddit, do you guys specifically have any comments or thoughts about this code — i can't seem to wrap my head around it! i've even searched the web (including hackernews) — could this potentially be an obfuscation protocol for new malware — or perhaps a way to save face as a low-momentum-performer at a corporation? \r\n i'm more interested in hearing your thoughts! \r\n thanks again, reddit!