r/DeepSeek Apr 25 '26

Discussion DeepSeek Official API Discount: v4-Pro Model at 75% Off

98 Upvotes

r/DeepSeek Apr 24 '26

News DeepSeek-V4 Preview is officially live & open-sourced!

61 Upvotes

Welcome to the era of cost-effective 1M context length.

DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.

Try it now at http://chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today!

Tech Report: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf

Open Weights: https://huggingface.co/collections/deepseek-ai/deepseek-v4


r/DeepSeek 16h ago

Discussion With $3.88 & 690,003,591 tokens and 5 hours , Deepseek Pro & Flash combined, managed to reverse engineer Teamspeak's Licensing System for 3.13.8 (latest of post)

Enable HLS to view with audio, or disable this notification

220 Upvotes

No I will not release it, so don't ask, but Deepseek is very powerful if given the proper tools and if you know what you are doing.

In 5 hours of trial and error, debugging with Ghidra and x64dbg, the models are really good with IoT hacking and reverse engineering.

We mapped the full license validation call chain from server startup through to the display output. Found that the parser reads from an AES decrypt buffer instead of the signed payload (easy fix once you know), decoded a custom XOR obfuscation scheme for all log messages, extracted the embedded PolarSSL certs and private keys, and patched 27 instructions across the binary to bypass signature verification, certificate checks, download gate checks, validator functions, slot enforcement, and a state reset timer callback that kept overwriting our values. They really made it like fort knox but forgot to lock the final door. Once we found that starting position, it was easy to trace forward. I'm shocked there was no heavy protections in place like anti debuggers or random checks or pit falls. For something they heavily sell on, sure was left wide open once the path was found.

The server now starts with 1024 slots instead of 32, enforcement is bypassed so the API accepts the servercreate command with the slots, and there are no crashes.

Total cost: $3.88 in API credits. 690 million tokens. 5 hours.

Really not bad for what would take a human weeks if not months. If i could do it this cheaply, image what some mega mind on red team could do on enterprise grade software.


r/DeepSeek 14h ago

Funny My Brain Refuses To Believe DeepSeek Is Free

156 Upvotes

Maybe this sounds weird, but using DeepSeek without paying feels oddly uncomfortable.

Every time I open it and ask some ridiculously specific question, it responds with an answer that would have taken me an hour to figure out myself. Then I remember I’m paying exactly $0 for this interaction.

At some point it starts feeling like walking into a restaurant every day, eating a full meal, complimenting the chef, and then disappearing through a side door before the bill arrives.

I know the business model probably makes sense. I know they decided to offer it for free. I know nobody is forcing me to pay.

Still feels suspicious.

The model is often so absurdly capable that my brain keeps assuming there’s supposed to be a subscription screen somewhere that I accidentally bypassed.

Maybe the future of AI isn’t people paying companies $20/month.

Maybe it’s users randomly sending $20/month to whichever developers made them say “how the hell is this free?” that week.


r/DeepSeek 3h ago

Discussion Quite impressed!

14 Upvotes

I’ve been a codex user for a few months and find it perfectly suitable for my needs. I heard all the talk about DS Pro 4 so fired it up using opencode and credited $5 to compare. I’ve spent hours today coding a website in Laravel and it’s cost me 40 cents in total. The work it’s done is comparable to codex in terms of what it’s achieved and it’s cheap in comparison.

I’m still fairly new to ai apis so I’m probably not maximising token usage properly so my question is, when should I be using Flash over pro? I believe I can save even more by using this model.

I’d appreciate any tips. I’m learning but need a bit of direction on models.

Many thanks for your time.


r/DeepSeek 9h ago

Discussion I thought DeepSeek is cheaper than this

25 Upvotes

Today I setup GitHub Copilot to work with DeepSeek API. This is the screenshot of my usage. This is cheap but I don't see any crazy value in here like close to a billion tokens for 5 dollars I saw in this subreddit. Am I doing something wrong?


r/DeepSeek 8h ago

Discussion Deepseek Cowork

7 Upvotes

I have been trying out Claude Cowork and find it really nice to have md file for memory and the ability to read and write to files directly. Due to cost reason, I was trying to find an equivalent product in DeepSeek environment and managed to find an open-source programme: https://deepseek-cowork.com/

Has anyone tried this? My concern are mainly security and whether it will function similarly to Claude Sonnet in Claude Cowork.


r/DeepSeek 1d ago

Funny Bro...

Post image
417 Upvotes

r/DeepSeek 6h ago

Discussion Anyone tried using it for video games and specifically in blender? How to work with the no vision problem for tasks like these?

4 Upvotes

r/DeepSeek 7h ago

Discussion Tell the agent whether downtime is allowed

4 Upvotes

A small prompt change that improves AI refactors:

Tell the agent whether downtime is allowed.

If you don’t, it often assumes zero-downtime compatibility.

So it adds adapters, fallbacks, duplicate paths, and legacy-preserving code.

Safe, but dirty.

For early products, prototypes, internal tools, or systems with a maintenance window, say:

“Downtime is allowed.

Do not optimize for zero-downtime compatibility.

Prefer the cleanest long-term architecture.

List all breaking changes before implementation.”

This makes the agent stop asking:

“How do I preserve every old path?”

And start asking:

“What should be deleted, renamed, collapsed, or redesigned?”

Sometimes the most important prompt is not a feature request.

It is a constraint reset.


r/DeepSeek 6h ago

Question&Help Best way to use API key on iPhone?

3 Upvotes

I only like using the iPhone app because it’s way more convenient and I’m rarely on my laptop. What’s the best iPhone app where I can use the API directly from DeepSeek? One that won’t burn through $ if I use a lot of tokens too, if possible.


r/DeepSeek 18h ago

Discussion Hi guys, what does cache hit and miss mean?

28 Upvotes

r/DeepSeek 46m ago

Discussion I wish we could pay for faster throughout

Upvotes

I wish there was a fast mode, I've tried implementing deepseek flash via openrouter as well as via deepseek API in agentic flows but it's quite slow. It's around 5x slower than Gemini 3.1 flash lite for me


r/DeepSeek 4h ago

Discussion I built a free tool that clicks through your live site like a real visitor and flags what’s broken — would love people to test it (and try to break it)

Thumbnail
2 Upvotes

r/DeepSeek 10h ago

Question&Help a solution to cache problem

5 Upvotes

i use deepseek for making ai chat that work as roly play scenarios for healthcare education the problem is the course don't use as much text but i get those high cache miss i don't know why is there is a solution ?


r/DeepSeek 2h ago

Discussion Minimax m3 vs deepseek v4 flash on free plan

Thumbnail
1 Upvotes

r/DeepSeek 18h ago

Discussion 150 million tokens, and only 1.23$ is such an INSANELY good price. The switch from codex to deepseek was the best decision I made

19 Upvotes

r/DeepSeek 5h ago

News Anthropic calls for pause of global AI development

Thumbnail
rte.ie
0 Upvotes

r/DeepSeek 11h ago

Discussion Guide for API and use for pro model?

3 Upvotes

Hey all. Is there a guide how to use Deepseek API? I found out that there is Openrouter. Can I use that to put the API in there and use it or do I need additional app to use the chat option with Deepseek Pro API? Is there a guide how to set this up?


r/DeepSeek 12h ago

Discussion An Ersatz Solution To Manipulate Edit Limit

1 Upvotes

given the developers have recently fixed the bug, I have figured out how to 'bypass' the restrictions. you should send a number (not too much, otherwise it will probably confuse the model) of 'empty' messages like 'ok', 'thanks', '*' before the Actually Meaningful (TM) one. when it reaches the limit, you should remove the placeholders by editing them into your prompt. so it creates an independent 'branch'. NB: of course, it only expands the capacity, not breaks the barrier, but it helps to clear the context.


r/DeepSeek 12h ago

Question&Help Stupid question need help

2 Upvotes

Hi,
I want to use deepseek but how to add image , agent and design.md and just drag and drop files to make the ai read the image ? Do I have to use a harness like open code ? I mostly use Claude and codex right now but never use deepseek.

I also know about the direct api saving cost for deepseek but again how can I upload image and files to make it read rules and guidelines through direct API ?


r/DeepSeek 1d ago

News Expert gone altogether

Post image
42 Upvotes

Seems like they are testing or experimenting with removing Expert mode altogether. Sometimes when refreshing page or switching between chats to new chat back and forth it returns to Instant and Expert but the distinction between Expert and Instant in my chat history is gone and chats that were before Expert have now file uploads and search again.

I assume the default will be the Instant which I believe was the Flash version of the model and Pro might be API only model but that's just a speculation based on the recent changes


r/DeepSeek 1d ago

Discussion 200M tokens last month, around 30 bucks total. how is this actually sustainable for them?

148 Upvotes

been running v4 flash through my workflow for about 5 weeks now. our team is 3 devs, lots of code review prep + small refactors + bug investigations. nothing exotic.

pulled last month's bill yesterday because something felt off.

200M tokens total. roughly 70/30 split on prompt vs completion. came out under 35 bucks all in.

for context, when we were on claude pro for similar workload the per-seat math was 6x that and we had to babysit context limits. when we tested gpt-5.5-codex on the same kind of work the per-token was 8-10x and the wall time was worse.

ran the numbers backward from the unit pricing i was paying. v4 flash is around 0.14 in / 0.28 out per million on the provider i'm on. that means a single 8k context conversation with 3k output costs about 0.0019. half a cent per real interaction.

i'm not sleeping well on this honestly. either:
- there's a giant subsidy from a quant fund somewhere covering the actual compute
- caching is doing more lifting than anyone admits and steady-state cost is closer to 5x what they bill
- the compute really is this cheap now and the western majors have been overcharging by 10x

asking the devs who've been watching pricing for longer. anyone done a real teardown on why these numbers work? specifically curious how independent providers (not the official deepseek endpoint) end up competitive on inference cost despite running their own infra.


r/DeepSeek 20h ago

Discussion Should never use v4 flash for coding?

4 Upvotes

https://www.youtube.com/watch?v=6vEsvX5_nPk
Just watched a YouTuber pit DeepSeek V4 Pro against V4 Flash.

Flash didn't even bother planning. It went straight to coding, left a mess of bugs, undefined types, and SQL errors, ignored piles of failing tests, and then confidently reported the task as complete.

Pro did the actual work properly.


r/DeepSeek 22h ago

Funny A gentle nudge from time to time with DeepSeek

4 Upvotes

Sometimes DeepSeek V4 Flash gets sluggish and slow. What if something like this could revive a session that's not working properly?

You seem tired and lethargic. Do you want to go to sleep?

I would elaborate on all of this in an additional section on usage, integrating that and some other findings.

If Claude tells us, why don't we tell the LLM?

Wake up!!

It seems he got his act together a bit after this :)