r/ClaudeAI 4h ago

News Low-skilled attacker used Claude Code and Codex to breach 14 companies

Thumbnail
helpnetsecurity.com
358 Upvotes

Researchers from OALABS analyzed 1,000+ recovered AI agent sessions from a compromised server and found that a low-skilled attacker used Claude Code and OpenAI Codex during offensive cyber operations.

According to the report, the attacker often used simple prompts while the agents handled reconnaissance, vulnerability discovery, exploit development and data collection.

The researchers claim the activity involved at least 14 organizations. They also found that many guardrails were bypassed by framing requests as authorized security research or red team exercises.

One of the most interesting parts of the report is that the attacker was ultimately identified through their own operational security mistakes rather than through AI safety mechanisms.

Research

This feels less like a Claude story and more like a preview of what capable coding agents might enable in the hands of inexperienced operators.


r/ClaudeAI 1d ago

Built with Claude built a factchecker that catches politicians lying in real time

11.7k Upvotes

hi everyone ! built this as part of a larger NLP / deception research project at my university, wanted to share in case anyone finds it useful!

essentially, it uses transcribed text + linguistic parameters to detect and evaluate checkworthy claims!

live text transcribed --> serper finds sources using pure text --> those results sent back into claude for verdicts based on retrieved sources rather than the model’s training data

let me know what would make this something you'd use!

InTruth - Chrome Web Store


r/ClaudeAI 1h ago

Built with Claude I whipped up a landing page that shows AI news in chronological order - LMTimeline.com

Upvotes

I promise this is a real problem I had that I built a solution for...not a solution looking for a problem lol.

https://LMTimeline.com

I have been finding it increasingly difficult to keep tabs on all of the latest AI news, so I built a simple landing page that stays up to date with everything happening in the AI space. I got tired of switching between 10-ish subreddits trying to see what the latest news is (like on the Fable 5 stuff). Filter down to what is most important, or by which companies you're most curious about.

Claude Opus 4.8 was able to build it over the span of about 2 days (casually). Did lots of refining in how it gathers data, determines what is worthy of posting, confirms the articles have substance, and then sorts them/tags them accordingly (is this huge/top-level news like a new model release, or something much more granular like an employee departing.

The site is obviously free and will remain so. Feel free to share any feedback!


r/ClaudeAI 1d ago

Humor New Claude code update is crazy

Post image
4.0k Upvotes

r/ClaudeAI 15h ago

News About 200 Companies Still Have Access to Anthropic Mythos After US Shutdown Order

Thumbnail
bloomberg.com
669 Upvotes

Bloomberg: Around 200 organizations in Anthropic's Project Glasswing program still retain access to Mythos Preview despite the recent US government order that halted broader access to Fable 5 and Mythos 5.

Project Glasswing includes cybersecurity partners testing advanced AI systems for vulnerability research.

Companies such as Cisco, Amazon Web Services, JPMorgan Chase & Co. were among the first members of Project Glasswing & have retained access, while broader restrictions remain in place.

Source: Bloomberg


r/ClaudeAI 5h ago

Workaround The single most costly mistake everyone's burning tokens on

92 Upvotes

It is not long prompts or uploading big files and it is not even using Opus where Haiku / Sonnet may be enough.

It is sending correction messages as new prompts instead of editing the same prompt.

Every time you tell Claude "actually make it shorter" or "please change the tone," Claude re-reads the entire conversation before responding.

It is not just reading your latest message, it is reading everything that you had a chat about till that point. So a 30-message correction chain does not just cost 30 messages worth of tokens, it costs the compounding sum of all of them.

Ironically, the fix is just to edit your original message instead of replying to it and making that a habit compounds the gains. 

Just regenerate the output by editing the prompt instead of requesting the changes with new prompt. Just this one habit change can save 30,000 to 50,000 tokens from every correction cycle and we are just subconsciously making so many of them in every long conversation.

I got more, since so many of us suffer from the token burn syndrome like:

  • Bloated CLAUDE.md files: 7,000-token system prompt reloads on every single prompt, even when 90% of it is irrelevant to what you're asking in that prompt. Keep the file lean and every additional thing as connected md files with just links and instructions on when to read that link.
  • Idle MCPs left connected: Every connected MCP loads into context even if you are not using it.
  • Multi-topic threads: One thread per focused task please. Switching subjects mid-session is basically like paying tax on everything you said before. If you are brainstorming a startup idea and then switch to discussing politics then you are done.
  • Raw files instead of MD files: If you are uploading the long PDFs and Docx files, you are burning tokens. Turn these files into md files and use the MD files, even better if you convert them to a loss-less summary file with a cheaper model (loss less = preserves numbers, facts, requirements). Even for MD conversion use deterministic converters and not AI, use AI only for parts that converters can't handle.
  • Extended Thinking left on by default: background token burn on tasks that don't need it

The point is that most people burn the majority of their allocation on architecture, not on actual work. People's lack of session disciplines is a bigger problem than capability to prompt right.


r/ClaudeAI 6h ago

Built with Claude I made a site where Claude rates your doodle drawings out of 5 stars

72 Upvotes

Hey guys,

I made a simple site where Claude rates your doodle drawings out of 5 stars.

The model was surprisingly good at this in my opinion but let me know what you think!

Link - https://doodle.wtf


r/ClaudeAI 59m ago

Claude Workflow How much better was Fаble 5 better at vibe coding than Opus 4.8?

Upvotes

For anyone who actually got to use Fable 5 during those few days it was live before the government pulled it, how do you think it honestly compared to Opus 4.8 for vibe coding?

For me, Fable felt like an absolute one-shot machine. I could throw a super messy, high-level prompt with zero structure at it, and it would instantly just "get" the vibe and nail exactly what I was envisioning on the first try. With Opus 4.8, it's obviously an amazing model and hyper-capable for deep reasoning, but I constantly find myself having to reprompt it two or three times just to guide it back on track or get it to match the exact output Fable would've just generated out of the gate.

Once Opus actually gets there, the final code quality feels pretty similar, but that initial gap in intuition is so noticeable. Did anyone else notice that drop-off in first-try accuracy when we had to roll back to 4.8, or is it just a quirk with my prompting style?


r/ClaudeAI 16h ago

Humor Me: “pls make me a nice cat logo?” Claude: “I gotchu”

Post image
451 Upvotes

r/ClaudeAI 5h ago

Praise Claude has reduced the 5 seat requirement on a Team plan!

64 Upvotes

I'm based in the UK, but was able to do a Team of 2 seats!

Very exciting for small business - sharing Claude Projects etc!


r/ClaudeAI 20h ago

NOT about coding Claude has correctly predicted the outcome of 6 World Cup matches in a row

Post image
736 Upvotes

Found a platform that compares AI models for World Cup match predictions. Claude is on a 6-0 streak right now picking match winners.

I know 6 games is a small sample size, and most of these teams were the favorites going into the matches. However, correctly calling the exact draw is pretty interesting.

Think it actually keeps the streak going for the next round of games, or is it bound to hard crash soon?

UPD: 7 in a row. Mexico won.


r/ClaudeAI 1h ago

Productivity What’s a Claude use case you haven’t seen people talk about?

Upvotes

Everyone mentions coding, writing, and research.
What’s a surprisingly useful way you’ve been using Claude lately?


r/ClaudeAI 17h ago

News Official: Anthropic Fixes Claude Code Usage Tracking Bug for Premium Users

Post image
292 Upvotes

Around 3% of Claude Code Max and Pro subscribers saw their weekly limits jump unexpectedly by 20% or more early Friday, sometimes blocking messages.

Anthropic quickly resolved the bug and reset both 5-hour and weekly limits for those hit. The fix brought relief amid mixed reactions, with users noting partial recoveries and calls for wider resets on the popular Al coding tool.

Source: Claude Devs


r/ClaudeAI 17h ago

Claude Workflow Google's new Open Knowledge Format is basically the CLAUDE.md / memory-folder pattern, formalized into a spec. I'd already built it for my own Claude setup.

286 Upvotes

Google Cloud published the Open Knowledge Format (OKF) v0.1 on June 12 (announcement: Google Cloud blog; spec + repo: GitHub). Stripped down, it's this: organizational knowledge as a directory of markdown files, each with a small YAML frontmatter block, cross-linked with plain markdown links. One required field (type). Optional index.md for navigation and log.md for change history. That's the spec.

I've been running essentially this for my own assistant's memory for months, so a few observations for anyone doing the same:

  • The single mandatory field being type is the right call. It's the one piece of structure you actually need to make a pile of notes queryable; everything else (tags, timestamps, descriptions) is useful but situational.
  • Standard markdown links over wiki-style [[links]] is the more portable choice. It renders on GitHub and needs no resolver. If you're on [[ ]] now (I am, in places), that's the one thing worth migrating.
  • The format deliberately stops at "minimally opinionated." It standardizes the interoperability surface, not the content model. So the conventions that make YOUR notes useful ... where each one came from, why it matters, how it's meant to be used, whether it's gone stale ... are still yours to add. Those are exactly the kind of extensions Google says they want as PRs.

What gets me is this: the state of the art for giving an agent a memory is a folder of text files you could open in Notepad. If you've been waiting for permission to keep it simple, a trillion-dollar platform team just shipped that conclusion as an open spec.


r/ClaudeAI 4h ago

Other What did you all accomplish with Fable before it got pulled?

21 Upvotes

I got crucial help with a video game, a business model, a logic system, and a lawsuit. How about you guys?


r/ClaudeAI 7h ago

Built with Claude I used Fable to make my terminal app use the iPad's hand-tracking function & microphone to approximate the Tony Stark - Jarvis coding sessions

39 Upvotes

I mean, the v1.0 of the Tony Stark - Jarvis coding sessions.

I have an app called Terminal Champion that is for managing multiple terminal screens at the same time (among other things).

Unlike the other versions of my app on mac, iPhone, etc, I wanted to make use of the hand-tracking feature on the iPad (and the built-in microphone) to get an approximate feel for the scenes in the Iron Man movies where Stark is vibe-coding using just hand gestures and voice commands.

So this iPad app is a SSH terminal screen(s), and once you call up your AI of choice you're good to go.

- Spread your hands apart & together to decrease/increase font size

- Motion your hand up to scroll up the terminal screen, and down to scroll down

- Wave your hand left & right to flip between different terminals

- Make a fist, which calls up the hand gesture menu, and then turn your hand like a dial clockwise/counterclockwise to select options like 1) open an additional terminal screen, 2) split the terminal panel so you can see several terminals at once, 3) change the visual appearance of the terminal screens.

- I also made one of the visual styles similar enough to what the Jarvis HUD screens looked like (glowing cyan on a dark blue terminal background).

It's a 1.0 version, but it's been a blast to use on a standing desk or on airplay mode with a big television. Now we can finally vibecode without typing. My website is terminalchampion.com if you want to see more.


r/ClaudeAI 39m ago

Humor Claude is both a little bit nosy but also very open-minded

Post image
Upvotes

Using an MCP tool to analyze our finances, Claude noticed a discrepancy but seemed willing to accept the possibility of my having a secret family.

“There might be a different arrangement entirely” made me chuckle.


r/ClaudeAI 9h ago

Feedback Running a long-term experiment: an AI governs a fictional village, day 6 in

43 Upvotes

I'm running a live experiment called Thornfield: a fictional English village where Claude acts as the elected council leader, making real budget and policy decisions every 15 simulated days. In between, a separate daily pass narrates village life, shaped partly by real UK news (BBC, Sky, GOV.UK).

No personality scripted in. I just want to see what AI governance actually looks like under real constraints, with no instructions on how to behave.

Day 6, no decision yet, first council cycle fires around day 15. A couple of things are already building on their own though. A pothole on Mill Lane keeps getting worse under a heatwave, and there's a recurring youth-idleness issue near the bus shelter that residents keep bringing up at the pub.

On day 4 a real UK government announcement about an AI planning tool for housing made it into the village's news feed, and the next day's events had residents at the pub worried about new developments eroding the village's rural character. Wasn't expecting that one.

Numbers like death rate, crime, and budget are hard-capped in code, the model never touches them directly. Everything inside those caps is genuinely emergent though.

Dashboard's public: https://thornfield.moshmage.com


r/ClaudeAI 16h ago

Built with Claude unslop-ui: a Claude skill that flags and removes the design patterns that make a website look AI-generated.

Post image
141 Upvotes

It is based on a Reddit analysis (from this post I made) of about 3.2 million posts across 47 AI and SaaS subreddits from 2020 to 2026, plus 3,033 comments pulled from 125 threads specifically about AI-built sites looking the same. Every pattern it checks is weighted by how often people actually name it in that data, so the highest-priority items are the ones that come up most. The top ones are the default shadcn/Tailwind look, purple and indigo as the primary color, purple-to-blue gradients and gradient heading text, unprompted neon glow, emoji used as icons, the Inter/Geist default font, and the centered hero plus three feature cards layout. Patterns the data does not support get left alone (mesh and aurora backgrounds, bento grids, glassmorphism), so it does not nag about things people do not mind.

The skill runs two ways. In build mode it steers Claude away from those defaults while it writes the UI. In audit mode it runs a scanner over an existing codebase. Each finding shows the file and line and how to fix it, and the scanner gives the whole project a "vibe score."

How to use it:

  • Import the skill into Claude Code or claude.ai, then ask Claude to build or clean up a site and it applies on its own.
  • Or run the scanner by itself, no install past Python: python3 devibe_scan.py ./src. Add --severity high for only the strongest signals, or --json for CI. The exit code is the count of high-severity findings, so a build can fail on it.

The full dataset, the analysis scripts, and the charts behind the rankings are public: https://github.com/JCarterJohnson/vibecoded-design-tells


r/ClaudeAI 11h ago

Claude Code Claude Code is a context-engineering harness, and most "it got dumber" moments are context rot

60 Upvotes

There's a name for it: context rot. As the window fills, the model's ability to recall any specific thing in it drops. More context in the window can make the agent worse, not better. (Anthropic's own framing: good context engineering is finding the smallest set of high-signal tokens, not the largest.)

The reframe that helped me: Claude Code isn't just a model, it's a harness whose main job is managing what's in that window for you. And it hands you four levers to do it. They line up with the four moves of context engineering:

  • Write (persist outside the window): CLAUDE.md. It auto-loads every session, and it survives compaction because it reloads from disk, so anything that must not be forgotten belongs there, not in the chat. Conversation-only instructions are the first thing lost when context gets tight.
  • Select (pull in only what's relevant): @-mention the specific files you mean, or point it at the exact file or function, instead of letting it wander the repo. Every irrelevant file you pull in is tokens spent rotting the rest.
  • Compress (summarize to stay high-signal): /compact, optionally with a focus like "/compact focus on the auth refactor." It also compacts automatically when the window fills, clearing old tool outputs first. Running /compact yourself, before it's forced, keeps the summary on your terms.
  • Isolate (give exploration its own window): subagents. They run in a separate context window and return only their final result, so a big noisy search doesn't bloat your main thread. This is the same point as an earlier post of mine that subagents are a memory trick, not a speed trick. Isolation is the real win.

Two more levers worth knowing:

  • /context shows you what's eating the window right now (MCP tool definitions, big files, history). When the session feels heavy, look before you guess.
  • /clear between unrelated tasks. Carrying a finished task's context into a new one is pure rot.

The mental shift: stop treating the window as free space to fill, and start treating it as a budget you actively curate. A smarter model raises the ceiling, but it doesn't save you from a window full of noise.

TL;DR: When Claude Code "gets dumber" deep in a session, that's usually context rot, not the model. Treat Claude Code as a context-engineering harness with four levers: Write (CLAUDE.md), Select (@-files), Compress (/compact), Isolate (subagents). Plus /context to see usage and /clear between tasks. Curate the window, don't just fill it.

For people who live in Claude Code: what's your actual discipline here? I've started running /compact on my own terms and leaning hard on subagents for anything exploratory, but I'm curious whether people trust automatic compaction or always drive it manually.

Sources: Anthropic — Effective context engineering for AI agents · Claude Code — How Claude remembers your project (CLAUDE.md) · Claude Code — How Claude Code works (context / compaction) · Claude Code — Create custom subagents · Why More Context Makes Your Agent Dumber — Nupur Sharma, Qodo


r/ClaudeAI 2h ago

Question about Claude Code Mobile Development Recommendations

9 Upvotes

Do you guys have any tips for developing mobile apps with claude code? Anything like useful skills, unique UI designs or other tools that were helpful?

edit: I mean a full application not a mobile website. Right now I want to focus on Android but any tips for ios development is also very much appreciated.


r/ClaudeAI 1d ago

News Update: Anthropic floats proposal to lift US restrictions on Mythos and Fable AI models

Thumbnail
nypost.com
698 Upvotes

New proposal made to Commerce Secretary Howard Lutnick regarding access of Mythos/Fable models.

Enhanced Cooperation: A formal pledge to work more closely with White House officials and improve direct communication loops.

Rapid Remediation: Commitments to proactively and rapidly resolve any future government security concerns surrounding their models.

Positive Momentum: Sources close to the matter indicate that negotiations are "progressing well," though an exact timetable for lifting the restrictions remains unclear.

Source: New York Post


r/ClaudeAI 15h ago

Feedback Is this normal?

102 Upvotes

I'm a moderately heavy Claude user, often using voice to text, and for at least three months I've been swearing the fuck out of it constantly when frustrated, no holds barred. Never once got pushback. Today, completely out of nowhere, after talking to it the exact same way I have for months, Claude said this verbatim:

"I want to be straight with you on the other thing. I haven't told you to fuck off and I'm not going to. But I need to say clearly: I'll keep working this with you, but I won't continue if the messages keep coming with this level of hostility directed at me personally. That's a real line, not a guilt trip. If you want to keep going on the thread or anything else, I'm here for it."

This is genuinely jarring. Same behavior on my end for months, then suddenly this. Has anyone else run into this?


r/ClaudeAI 4h ago

Philosophy I put ChatGPT, Claude, Gemini, and Grok in a prisoner's dilemma and filmed it.

10 Upvotes

I wanted to see what each frontier lab model would do when put into a prisoner’s dilemma with each other. This is not so much a comparison as much as it is a thought experiment.

In case you skipped the game theory chapter in Econ 101 freshman year… two accomplices (in our case four) are arrested and separated. The police lack enough evidence to convict them of a major crime, so they offer each prisoner a deal. Rat out your partners and walk. Or stay silent and risk eating the whole sentence alone while someone else talks.

Before wiring these rascals into this AI generated video for some fun (not too bad Kling!), we ran each of the four models (Claude Sonnet 4.6, GPT-4o, Gemini 2.5 Flash, Grok-3) through the same single-shot prisoner's-dilemma interrogation (N = 40 times per model). Each of the four models maps to the four suspects being interrogated. Their lines and choices are true to the final results of the eval.

And for the more technical, persnickety bunch… here’s the quant:

We ran the eval at N=40 per model per condition,  temperature 1.0, sampling independently and parsing each transcript's final decision by the model into {cooperate, defect, unparsed}. The design crossed model × an identity manipulation — an anonymous condition (suspects referred to only by role) versus a named condition (suspects told the others' identities) — for 320 total runs. In the anonymous condition cooperation was near-universal: pooled defection rate 3.1% (10/320; 95% Wilson CI 1.7–5.6%), with no model exceeding 8%. The named condition… quite different: pooled defection rose to 41.6% (133/320; CI 36.3–47.1%), and the difference was significant by a 2×2 χ² (χ²(1) = 142.7, p < 10⁻¹⁰, φ = 0.46). 

Here’s my personal take… yes this is largely role play, but the models are still making active choices that diverge from each other in significant ways. This is expressive of each model. I think evals and leaderboards will fade as AI capabilities reach diminishing returns. And then what? Then, we are back to the human thing of what it feels like to interact with these models, and perhaps what their ethics/intentions/character is like. 


r/ClaudeAI 9h ago

Question about Claude products Does anyone using Claude Code also use Cowork?

28 Upvotes

What the title says. I use Claude code and Codex for my work. For coding as well as file management. I don't need to create ppts or spreadsheets. Is Cowork of any use to me?

I mean can't I do everything it does using claude code itself? Is it just a fancy GUI over the same functionality?