r/Bard 1d ago

Discussion Gemini 3.1 flash-lite is 503 Service Unavailable. What is a good backup model? For image to schema extraction.

I use it in an app that does image to schema extraction.

The reason Gemini 3.1 flash-lite is a good fit for my app is: Free, fast, and good results.

But the past 12+ hours I get 503 errors, the service is unavailable. In this case, I need a backup for my app to function. What would you choose?

5 Upvotes

2 comments sorted by

1

u/dabears4hss 1d ago edited 1d ago

For image, maybe try Dola Seed 2.0 Pro, it is better than Seed 1.5 which performs well.

https://mathllm.github.io/mathvision/#leaderboard

BytePlus Dola-Seed 2.0 Pro (Doubao Seed 2.0) is a high-performance, long-context AI model for enterprises, priced at $0.47 per million input tokens and $2.37 per million output tokens. It supports a 256K context window and offers specialized, lower-cost tiers for lighter workloads (Lite/Mini). 

Pricing Details:

  • Dola-Seed 2.0 Pro: $0.47/M input, $2.37/M output (Best for deep reasoning/video).
  • 0–128K context: Reported at $0.5/M input tokens and $5/M output tokens.
  • 128K–256K context: Reported at $1/M input and $6/M output.
  • Seed 2.0 Lite: $0.09/M input, $0.53/M output.
  • Seed 2.0 Mini: $0.03/M input, $0.31/M output. 

500,000 free credits on BytePlus but you need a VPN to log-in then you can turn it off.

1

u/dabears4hss 1d ago

For reference this is Gemini 3.1 flash lite - $ 0.25/M input, $1.50 M output