u/JaeSwift

Feedback Needed | SFW False Positives in Seedance 2.0

Hey everyone,

the team would like to gather some feedback from you to improve Seedance 2.0's content filtering and need your help identifying SFW content thats being incorrectly refused by the model (false positives).

We're looking for:

Images and prompts that are genuinely SFW but got refused
Clear examples we can reproduce and share with ByteDance

What we're NOT looking for:

NSFW content of any kind or its refusal
Private or sensitive images

if you've had SFW content flagged incorrectly, please share the image and prompt here.

the team will use these examples to identify patterns and work with ByteDance on reducing future false positives.

IMPORTANT FOR THOSE AT THE BACK:

WE ARE NOT LOOKING FOR NSFW CONTENT THAT HAD REFUSALS!

reddit.com

u/JaeSwift — 2 days ago

▲ 3 r/VeniceAI

The official Venice MCP Server is live

https://reddit.com/link/1ti4y7e/video/vekd16aem62h1/player

This is the official Venice MCP server and it is now live.

31 tools across every single Venice modality, all reachable through one MCP config block.

Chat, image, video, music, TTS with cloned voices and emotion tags, ASR, web search and scrape, document parsing, character chat, crypto RPC, plus the full x402 wallet flow if you want to pay with USDC on base or have your agent buy its own inference.

Drop it into Claude Desktop, Cursor, Codex, Hermes, Openclaw, Agent Zero, LM studio, Continue, Librechat, Jan, Le Chat, AnythingLLM, Open WebUI, or any other MCP host.

One NPX command, One env var, you are done.

GitHub: http://github.com/veniceai/venice-mcp-server
Guide: https://docs.venice.ai/guides/integrations/venice-mcp

reddit.com

u/JaeSwift — 2 days ago

▲ 7 r/VeniceAI

$VVV is now available to trade on Robinhood Crypto. (including NY)

https://preview.redd.it/rx1rri3a642h1.png?width=960&format=png&auto=webp&s=ea13cc314da3c72fd1afcecbd27234411a003866

reddit.com

u/JaeSwift — 3 days ago

▲ 6 r/VeniceAI

Agent-powered chat. Every modality. Private by default.

https://i.redd.it/mro2uwkyoy1h1.gif

Agent-powered chat. Every modality. Private by default.

Agentic chat is rolling out to all new users now.

Take a peek: https://venice.ai/chat/v2

reddit.com

u/JaeSwift — 4 days ago

▲ 2 r/VeniceAI

Join the Agentic AI Hackathon w/ MetaMask Developer with $16K in prizes across 5 tracks

https://i.redd.it/fc9od0yn0y1h1.gif

Build agents that use programmable wallets, granular permission sharing, and on-chain execution via MetaMask Smart Accounts Kit.

Venice is sponsoring the Best Use of Venice API track ($3K). We want to see agents that reason, generate media, and transact through one API layer. Projects that combine Venice with MetaMask permissions, onchain data, or x402 payments score higher.

Submissions due June 15.

Registration and full information on tracks and judging criteria:
https://www.hackquest.io/hackathons/MetaMask-Smart-Accounts-Kit-x-1Shot-API-Dev-Cook-Off

reddit.com

u/JaeSwift — 4 days ago

▲ 3 r/VeniceAI

Source picture to sticker on the truck with v2 agentic chat

https://preview.redd.it/33kthr9jkw1h1.png?width=900&format=png&auto=webp&s=6137f1e38c6b8efc30e7337cbc38dbdc0c95b658

https://preview.redd.it/ab6cpkbkkw1h1.png?width=1086&format=png&auto=webp&s=50620d676dfe9d205a7b1d3f30760ab637a033cb

https://preview.redd.it/pvum5yilkw1h1.png?width=1072&format=png&auto=webp&s=ebea09bb45881347b025f63b3cc61985f5a50c2e

https://preview.redd.it/l8s438fmkw1h1.png?width=880&format=png&auto=webp&s=d7b225997b30961a0e7924940121aa4b92d14554

^(x/jesseproudman)

reddit.com

u/JaeSwift — 4 days ago

▲ 5 r/VeniceAI

Daily video inference requests on Venice

https://preview.redd.it/19t21nhdjw1h1.png?width=1200&format=png&auto=webp&s=5a1069f934fc4123c832293f4b7bb8f8ce2dc1e6

reddit.com

u/JaeSwift — 4 days ago

▲ 51 r/VeniceAI

Venice passes 3,000,000 users

https://i.redd.it/9ntftlsp1k1h1.gif

First million: 13 months
Second million: 7 months
Third million: 3 months

To every builder, creator, and free thinker who chooses privacy: this is your milestone.

reddit.com

u/JaeSwift — 6 days ago

▲ 11 r/VeniceAI

Venice.ai now an official provider in OpenClaude

https://reddit.com/link/1tdqger/video/7niuni23l91h1/player

Venice.ai is now an official provider in OpenClaude giving you private, unrestricted AI right inside your coding workflow. OpenClaude is an open-source coding-agent CLI for cloud and local model providers.

Try OpenClaude with your Venice API key here: https://openclaude.gitlawb.com/

reddit.com

u/JaeSwift — 7 days ago

▲ 9 r/VeniceAI

Venice.ai Bug Bounty Program

Our bug bounty program provides a structured way for security researchers to report vulnerabilities and earn rewards.

https://i.redd.it/je8h0nlv691h1.gif

How the Program Works

Discover
- Find a security vulnerability in a Venice product within scope.
Document
- Create a detailed report with proof of concept and reproduction steps.
Report
- Submit your findings through our secure reporting form.
Get Rewarded
- Eligible bounty rewards are paid in VVV and/or USDC at Venice's discretion.

Rewards
Eligible bounty rewards are paid in VVV and/or USDC at Venice's discretion. Reward amounts below are denominated in VVV; Venice may pay an equivalent amount in USDC. Rewards are based on demonstrated impact, exploitability, novelty, reproducibility, and report quality.

Low - Starts at 25 VVV
Minor but real security issues with clear reproduction, such as limited information disclosure or low-risk configuration gaps with demonstrated impact.

Medium - Starts at 50 VVV
Contained vulnerabilities with demonstrated impact, such as limited stored XSS, narrow authorization gaps, or abuse paths affecting a small set of users.

High - Starts at 250 VVV
Significant vulnerabilities affecting account security, authorization boundaries, billing integrity, or user data.

Critical - Case-by-case, scaled to impact
Severe, novel vulnerabilities with broad user impact, such as authentication bypass, remote code execution, sensitive data exposure at scale, or VVV smart contract compromise.

Scope

https://preview.redd.it/21u5y6x2891h1.png?width=966&format=png&auto=webp&s=a12b00d7b50e3329f86e17fcacaa253be6633cb9

Report Requirements
All submissions must include a clear proof of concept, exact reproduction steps, and a description of the security impact.

Proof of Concept
Demonstrate actual exploitation with video, screenshots, or exact reproduction steps. Exceptions apply for self-evident issues like certificate problems.

Specific to Our Environment
Show the vulnerability exists in our actual application, not theoretical scenarios. Generic scanner output without validation is not eligible for rewards.

Based on Your Testing
AI-assisted reports are acceptable only if they are personally validated and reproducible. Unverified AI-generated reports will be rejected.

How to submit report?
You can submit your bug report here.

__________________

^(FAQ)
^(Who is eligible to participate?)
^(Anyone can participate in the Venice Bug Bounty Program. You must be at least 18 years old or have parental consent. Participants must comply with all applicable laws and these program rules.)

^(How are rewards determined?)
^(Rewards are determined by Venice based on severity, impact, exploitability, novelty, reproducibility, and report quality. Eligible bounty rewards are paid in VVV and/or USDC at Venice's discretion. Small valid reports typically start at 25 VVV, with larger rewards considered for higher-impact vulnerabilities.)

^(What makes a good report?)
^(A good report includes a clear title, exact reproduction steps, proof of concept, the affected Venice product or endpoint, expected versus actual behaviour, security impact, and any relevant screenshots, videos, commands, or request/response details. The more complete and reproducible your report is, the faster we can validate it.)

^(How long does the review process take?)
^(We review submissions as quickly as practical. Complex reports, duplicates, third-party issues, and reports requiring additional validation may take longer.)

^(Can I share my report publicly?)
^(No. Do not publicly disclose, discuss, or publish a vulnerability report without written approval from Venice. If public disclosure is appropriate, we will coordinate timing and credit with you after the issue has been resolved.)

^(What if my report is a duplicate?)
^(If another researcher has already reported the same vulnerability, the reward goes to the first valid report. Reports may also be closed without reward if they are already known issues, accepted risks, theoretical findings without demonstrated impact, third-party issues outside Venice's control, automated scanner output without validation, or AI-generated reports that have not been personally verified.)

^(How are rewards paid?)
^(Bounty rewards are paid in VVV and/or USDC on Base, at Venice's discretion. If your report is eligible for a bounty, we will ask you for a wallet address that can receive Base network assets. Please do not include wallet addresses, private keys, seed phrases, or sensitive payment information in your initial report. Helpful reports that do not qualify for a bounty may receive Venice Credits or a Venice Pro subscription at Venice's discretion.)

reddit.com

u/JaeSwift — 7 days ago

▲ 3 r/VeniceAI

Crypto RPC for Agents | Give your AI agent inference and on-chain access

Venice gives your agent both inference (230+ models) and blockchain access (10 EVM chains plus Starknet) through a single credential. Your agent can think, sign, and send transactions without juggling separate accounts for inference and RPC providers.

One credential, two superpowers
A single API key (or wallet) for both LLM inference and JSON-RPC calls.

11 chains supported
Ethereum, Base, Arbitrum, Optimism, Polygon, Linea, Avalanche, BSC, Blast, zkSync Era, and Starknet (mainnet plus testnets).

Stake VVV for headless funding
Stake VVV on Base to earn daily DIEM, the only fully headless funding path for a minted API key. USD and crypto top-ups are also available through the dashboard.

Keyless auth via x402. Agents can authenticate with a wallet signature and pay in USDC on Base.

Why Venice for on-chain agents?

Capability	What your agent gets
Inference Authentication	230+ text, image, video, audio, and embedding models through one OpenAI-compatible endpoint
Crypto RPC	JSON-RPC 2.0 proxy to 10 EVM chains plus Starknet (mainnet and testnets)
Authentication	Standard API key or x402 wallet auth (no Venice account required)
Funding	Autonomous: VVV staking for daily DIEM. Browser: USD or crypto top-ups via the dashboard
Batching	Up to 100 JSON-RPC calls per request, multi-chain in parallel
Idempotency	Safe retries with `Idempotency-Key` header

Authentication
Pick the auth method that matches how your agent runs.

Method	Best for	How it works
API key	Server-side agents, fixed deployments	`Authorization: Bearer <key>` header. Get a key at venice.ai/settings/api.
x402 wallet	Autonomous, crypto-native, or short-lived agents	Wallet signs a SIWE message, pays per request in USDC on Base. No Venice account needed. See the x402 guide.

^(Both methods share the same rate limits and billing in Venice credits.)

>Truly autonomous agents can mint their own API key by staking VVV on Base.
See Autonomous Agent API Key Creation.

Crypto RPC quickstart

Send any JSON-RPC 2.0 method to POST /crypto/rpc/{network}.

curl https://api.venice.ai/api/v1/crypto/rpc/ethereum-mainnet 
-H "Authorization: Bearer $VENICE_API_KEY" 
-H "Content-Type: application/json" 
-d '{ "jsonrpc": "2.0", "method": "eth_chainId", "params": [], "id": 1 }'

Response:

{ "jsonrpc": "2.0", "id": 1, "result": "0x1" }

Response headers include X-Venice-RPC-Credits (credits charged), X-Venice-RPC-Cost-USD (dollar cost), and X-Request-ID (correlation ID).

Supported networks

Family	Mainnet	Testnets
Ethereum	`ethereum-mainnet`	`ethereum-sepolia`, `ethereum-holesky`
Base	`base-mainnet`	`base-sepolia`
Arbitrum	`arbitrum-mainnet`	`arbitrum-sepolia`
Optimism	`optimism-mainnet`	`optimism-sepolia`
Polygon	`polygon-mainnet`	`polygon-amoy`
Linea	`linea-mainnet`	`linea-sepolia`
Avalanche C-Chain	`avalanche-mainnet`	`avalanche-fuji`
BNB Smart Chain	`bsc-mainnet`	`bsc-testnet`
Blast	`blast-mainnet`	`blast-sepolia`
zkSync Era	`zksync-mainnet`	`zksync-sepolia`
Starknet	`starknet-mainnet`	`starknet-sepolia`

Use GET /crypto/rpc/networks for the live, authoritative list.

Method tiers

Tier	Multiplier	Examples
Standard	1x	`eth_call`, `eth_getBalance`, `eth_blockNumber`, `eth_sendRawTransaction`, `eth_getLogs`, `eth_getTransactionReceipt`, `eth_estimateGas`
Advanced	2x	`trace_block`, `trace_call`, `trace_transaction`, `debug_traceCall`, `debug_traceTransaction`
Large	4x	`trace_replayBlockTransactions`, `trace_replayTransaction`, `txpool_content`

^(Full list and pricing detail in the) ^(Crypto RPC API reference)^(.)

Agent recipes
Common patterns for AI agents that need to read and write on-chain.

Read a wallet’s native balance

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "jsonrpc": "2.0",
    "method": "eth_getBalance",
    "params": ["0xYourWalletAddress", "latest"],
    "id": 1
  }'

Read ERC-20 token balance

Call the balanceOf(address) selector with eth_call. The data field is the 4-byte selector (0x70a08231) followed by the wallet address left-padded to 32 bytes. Easiest to let a library encode it:

import { encodeFunctionData, parseAbi } from 'viem'

const data = encodeFunctionData({
  abi: parseAbi(['function balanceOf(address) view returns (uint256)']),
  args: ['0xWalletAddress'],
})

const response = await fetch('https://api.venice.ai/api/v1/crypto/rpc/base-mainnet', {
  method: 'POST',
  headers: {
    Authorization: `Bearer ${process.env.VENICE_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    jsonrpc: '2.0',
    method: 'eth_call',
    params: [{ to: '0xacfE6019Ed1A7Dc6f7B508C02d1b04ec88cC21bf', data }, 'latest'],
    id: 1,
  }),
})

The contract address above is VVV on Base. Swap it for any ERC-20 contract.

Send a signed transaction (full lifecycle)
Venice never holds your private keys. The agent gathers tx parameters via RPC reads, signs locally with a library like viem or ethers, then relays the raw hex through Venice.

1. Get the next nonce

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","method":"eth_getTransactionCount","params":["0xAgentWallet","pending"],"id":1}'

Use "pending" so back-to-back sends don’t collide.

2. Get gas price

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","method":"eth_gasPrice","params":[],"id":1}'

For EIP-1559 chains, prefer eth_feeHistory to compute maxFeePerGas and maxPriorityFeePerGas.

3. Estimate gas

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","method":"eth_estimateGas","params":[{"from":"0xAgentWallet","to":"0xRecipient","value":"0x0","data":"0x..."}],"id":1}'

4. Sign locally

import { privateKeyToAccount } from 'viem/accounts'
import { base } from 'viem/chains'

const account = privateKeyToAccount(process.env.AGENT_PRIVATE_KEY)

const signed = await account.signTransaction({
  chainId: base.id,
  nonce,                  // from step 1
  gas,                    // from step 3
  maxFeePerGas,           // from step 2 (fee history)
  maxPriorityFeePerGas,   // from step 2 (fee history)
  to: '0xRecipient',
  value: 0n,
  data: '0x...',
})

5. Submit through Venice

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Idempotency-Key: agent-tx-&lt;id&gt;" \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","method":"eth_sendRawTransaction","params":["0xSignedHex"],"id":1}'

Always set Idempotency-Key on relays so a network blip can’t double-broadcast.

6. Poll for receipt

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","method":"eth_getTransactionReceipt","params":["0xTxHash"],"id":1}'

Poll every few seconds until result is non-null. Check result.status ("0x1" = success).

>Every eth_sendRawTransaction call is logged server-side with the tx hash, network, request ID, and calling user ID. The signed payload itself is not retained. This audit trail exists so compromised keys used for illicit relays can be traced back to the responsible account.

Batch multiple calls (multi-chain portfolio check)

Send up to 100 JSON-RPC objects in one request. Each is validated and billed independently.

curl https://api.venice.ai/api/v1/crypto/rpc/ethereum-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '[
    { "jsonrpc": "2.0", "method": "eth_blockNumber", "params": [], "id": 1 },
    { "jsonrpc": "2.0", "method": "eth_getBalance", "params": ["0xWallet", "latest"], "id": 2 },
    { "jsonrpc": "2.0", "method": "eth_gasPrice", "params": [], "id": 3 }
  ]'

For multi-chain reads (one call per chain), issue parallel requests to different {network} endpoints.

Safe retries with idempotency
Set the Idempotency-Key header to any string matching [A-Za-z0-9_-]{1,255}. Venice caches the response for 24 hours keyed on (user, key). Replays return the cached result with Idempotent-Replayed: true and charge nothing.

curl https://api.venice.ai/api/v1/crypto/rpc/base-mainnet \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Idempotency-Key: agent-tx-2026-04-21-001" \
  -H "Content-Type: application/json" \
  -d '{
    "jsonrpc": "2.0",
    "method": "eth_sendRawTransaction",
    "params": ["0xSignedRawTxHex"],
    "id": 1
  }'

This is critical for transaction relays where a network blip could otherwise cause your agent to broadcast the same tx twice.

Funding the agent’s API key
Once the agent has a Venice API key, it needs spendable balance on the underlying account before paid endpoints will accept the key. There are two ways to put balance there:

Path	Autonomous?	How it works
DIEM from VVV staking	Yes	Stake VVV in the Venice Staking Smart Contract on Base. The wallet’s daily DIEM allocation is proportional to its share of the staking pool. The account needs at least 0.1 DIEM accrued before any DIEM is spendable. DIEM refreshes at 00:00 UTC. To grow daily spend, stake more VVV.
USD or crypto top-up via the dashboard	No (browser)	Sign into venice.ai with the same wallet (Sign-In-With-Ethereum), then add credits in Settings, API. Both Stripe (card) and Coinbase (crypto) live behind that page and require a browser. Credits never expire.

For an agent that runs unattended, DIEM via VVV staking is the only fully headless funding path for a minted API key today. If the agent’s daily spend exceeds its DIEM allocation, the realistic options are: stake more VVV, or have an operator sign in and top up in USD or crypto.

Autonomous VVV staking and key generation
A truly autonomous agent can manage its own VVV wallet on Base, stake it, and mint its own Venice API key with no human in the loop. The full flow:

1. Acquire VVV and ETH for gas

Send VVV to the agent’s wallet (or have the agent swap on Aerodrome or Uniswap), plus a small amount of ETH on Base for the two staking transactions.

2. Stake VVV

approve the staking contract on the VVV token, then stake(amount) on 0x321b7ff75154472B18EDb199033fF4D116F340Ff. The wallet’s sVVV balance updates atomically with the stake.

3. Mint an API key

GET /api/v1/api_keys/generate_web3_key returns a JWT that expires 15 minutes after issuance. Sign the raw token with the staking wallet, then POST the address, signature, and token back. Venice returns an API key bound to the user account derived from that wallet.

Minting only requires a non-zero sVVV balance, so 1 staked VVV is enough to receive a key. Spending with the key is a separate question, governed by the funding table above.

See Autonomous Agent API Key Creation for the complete walkthrough with code and the full error reference.

x402 wallet auth in 30 seconds
If your agent already has a Base wallet, skip the API key entirely. The venice-x402-client SDK handles SIWE signing, top-ups, and balance tracking.

npm install venice-x402-client



import { VeniceClient } from 'venice-x402-client'

const venice = new VeniceClient(process.env.WALLET_KEY)

await venice.topUp(10) // skip if the wallet already has balance

const response = await venice.chat({
  model: 'kimi-k2-6',
  messages: [{ role: 'user', content: 'What is the latest block on Base?' }]
})

The same wallet auth works against /crypto/rpc/{network} for blockchain reads and writes. Full protocol details in the x402 guide.

Pricing
Crypto RPC is billed in Venice credits. Each response includes X-Venice-RPC-Credits (credits charged) and X-Venice-RPC-Cost-USD (dollar cost) so your agent can track spend per request.

Base credits per chain

Base credits	Chains
20	Ethereum, Base, Optimism, Arbitrum, Polygon, Linea, Avalanche, BSC, Blast, Starknet
30	zkSync Era

Cost examples

Observed pricing for standard, advanced, and large method tiers:

Call	Credits	USD cost
`eth_call` on Ethereum (20 × 1x)	20	~$0.0000140
`trace_transaction` on Ethereum (20 × 2x)	40	~$0.0000280
`trace_replayTransaction` on Ethereum (20 × 4x)	80	~$0.0000560
`eth_call` on zkSync (30 × 1x)	30	~$0.0000210

Always trust the X-Venice-RPC-Cost-USD response header for the authoritative cost. Errored items in batch requests are billed at a flat 5 credits each.

Rate limits

Tier	Requests per minute
Standard	100
Staff	1,000

When exceeded, the endpoint returns 429 with standard X-RateLimit-* response headers.

Error handling

Common HTTP responses your agent should handle:

Status	Meaning	What to do
`400`	Unsupported or unmapped JSON-RPC method, or malformed batch	Verify the method against the allowlist. The error body names the offending method.
`400`	Replay of an `Idempotency-Key` with a different body	Use a fresh key for distinct requests.
`402`	No auth header at all (response body includes `authOptions` listing both supported auth paths), or out of credits with a valid auth header	If no auth: attach `Authorization: Bearer ...` or the x402 `X-Sign-In-With-X` header. If out of credits: with a Bearer key, fund the account (DIEM, USD, or dashboard top-up); with x402 auth, call `POST /api/v1/x402/top-up` directly.
`429`	Rate limit hit (100 req/min standard, 1,000 req/min staff)	Honor `X-RateLimit-Reset` and back off. Batch up to 100 calls per request to amortize the limit.
`5xx`	Upstream RPC node hiccup	Retry with the same `Idempotency-Key` to avoid double-charging.

Per-item batch errors (e.g. invalid params on one of N calls) come back inside a 200 OK response with a JSON-RPC error field on the offending item. Those items are billed at a flat 5 credits each.

Not supported
These categories of methods are intentionally rejected:

WebSocket-only (eth_subscribe, eth_unsubscribe): the proxy is HTTP-only. Poll instead.
Stateful filters (eth_newFilter, eth_getFilterChanges, etc.): filter state is pinned to a single backend and breaks on a load-balanced proxy. Use eth_getLogs instead.
Key-holding methods (eth_sign, eth_accounts, eth_mining): hosted providers don’t hold user keys. Sign client-side and submit via eth_sendRawTransaction.
Unmapped methods: anything not allowlisted returns 400. Contact support to request additions.

To see all resources such as full method list, pricing, response headers and everything else to do with this guide, see here: https://docs.venice.ai/guides/integrations/crypto-rpc-agents#resources

u/JaeSwift — 9 days ago

▲ 12 r/VeniceAI

Grok Imagine High Quality is now live on Venice

https://reddit.com/link/1t6l9ih/video/qlbcxgberrzg1/player

xAI's most capable image model. Strongest photorealism, world knowledge, and prompt adherence in the Grok Imagine family. Up to 2K resolution.

Available to all users.

Replaces Grok Imagine Pro, which retires May 15.

reddit.com

u/JaeSwift — 15 days ago

▲ 3 r/VeniceAI

The latest $VVV discretionary burn has been executed

https://i.redd.it/0np9503dwqzg1.gif

See the transaction: https://basescan.org/tx/0x03c470a73ff5cde2d7bc8abde04fdc7ea06b79f6f1f5ef22342e64cecb53388f

Follow the burn at https://venice.ai/token

reddit.com

u/JaeSwift — 15 days ago

▲ 9 r/VeniceAI

Venice.ai Changelog - April 21 - May 5, 2026

https://preview.redd.it/199udpdwbqzg1.png?width=1376&format=png&auto=webp&s=8dc0dd741b3e36de5c647330b6540d2f1be1c912

Grok 4.3 on Venice

xAI's most intelligent reasoning model is now generally available on Venice. 1M-token context window, function calling, structured outputs, and multimodal support.

Voice Mode

Realtime voice conversations are now live on Venice. Talk to any model with memory sync, chat persistence, waveform visualization, push-to-talk input, and language switching. Now available on web, iOS and Android.

GPT-5.5 on Venice

OpenAI's latest-generation model family is now available on Venice. GPT-5.5 delivers improved reasoning, stronger instruction-following, and better multi-turn conversation across the board. GPT-5.5 Pro adds extended reasoning depth and a larger context window for demanding workloads. Both models are available now.

Kling 4K Video

Kuaishou's Kling V3 and O3 video models now generate native 4K output on Venice. Available in text-to-video, image-to-video, and reference-to-video modes, Kling 4K delivers sharper detail, better motion coherence, and cinematic-quality output at four times the resolution of previous generations.

Programmatic Burn Increase

Venice has increased the programmatic burns for new subscriptions: $2 in VVV for Pro, $5 in VVV for Pro+, and $10 in VVV for Max. Every new subscription triggers a buy-and-burn at these updated amounts.

New Models

The following models have been added to Venice:

Text Models

Grok 4.3 — xAI's most intelligent reasoning model with 1M-token context window, function calling, structured outputs, and multimodal support. Available to all users.
GPT-5.5 — OpenAI's latest-generation text model with improved reasoning, instruction-following, and multi-turn conversation. Available to all users.
GPT-5.5 Pro — OpenAI's higher-capability variant of GPT-5.5 with extended reasoning depth and larger context window. Pro users only.
DeepSeek V4 Pro — DeepSeek's full-size V4 reasoning model with extended context and strong performance on coding, math, and multi-step tasks. Available to all users.
DeepSeek V4 Flash — Lighter, faster variant of DeepSeek V4 optimized for speed and lower latency while retaining strong general-purpose performance. Available to all users.
Qwen 3.6 27B — Text model from Alibaba Cloud with 27 billion parameters, offering a balance of capability and efficiency with 128K context window. Available to all users.
GLM 5.1 E2EE — Zhipu AI's GLM 5.1 running with end-to-end encryption in a Trusted Execution Environment. Available to Pro users at no additional credit cost.

Image & Video Models

Kling V3 4K — Kuaishou text-to-video at native 4K resolution. Available to all users.
Kling V3 4K R2V — Kuaishou reference-to-video at native 4K resolution. Available to all users.
Kling O3 4K — Kuaishou O3-series text-to-video at native 4K resolution. Available to all users.
Kling O3 4K I2V — Kuaishou O3-series image-to-video at native 4K resolution. Available to all users.
Kling O3 4K R2V — Kuaishou O3-series reference-to-video at native 4K resolution. Available to all users.
HappyHorse 1.0 — Alibaba's text-to-video generation model. Available to all users.
HappyHorse 1.0 I2V — Image-to-video generation from a source image. Available to all users.
HappyHorse 1.0 Reference — Video generation guided by a reference image for style and content. Available to all users.
HappyHorse 1.0 Edit — Video editing model for modifying and transforming existing video. Available to all users.
Wan 2.7 Pro Edit — Alibaba DashScope image editing model for prompt-driven edits to existing images. Available to all users.

App

Improvements

Model Explorer Redesign — Refreshed layout for the Model Explorer with improved navigation and filtering.
Recommended Model Sort — New "Recommended" sort option in the model selector, prioritizing recently used models.
Model Details Modal — Model details can now be opened directly via URL in a dedicated modal.
Model Explorer Switcher — New entry point in the model switcher to navigate directly to the Model Explorer.
Prompt Enhancement Context — The prompt enhancement wand now incorporates conversation context when rewriting prompts.
Video Auto-Compression — Oversized videos are automatically compressed client-side before upload.
Per-Class PPU Toggles — Pay-per-use confirmation can now be toggled independently for each model class in chat.
Batch Delete Warning — Batch chat delete confirmation now warns that chats will be removed from other devices too.
Select All in Chat Delete — Added "Select All" option to the chat sidebar delete menu.
Image Auto-Downsize on Share — Images larger than 25 MB are automatically downsized before sharing.
Adaptive Thinking Always On — Removed the adaptive thinking toggle. Adaptive thinking is now always enabled.
Burn Type Tooltips — Tooltips now vary by burn type, with "Bought" label shown for discretionary burns.
China Server Location Flag — China flag icon now displayed for CN server locations in model details.
Sidebar Cleanup — Removed Help & Feedback button from the sidebar app menu.
PPU Confirmation Popup — Confirmation popup now shown when a pay-per-use model is routed.
Tool Call Loading Indicator — A loading spinner now appears in agentic chat while waiting for the next tool to execute.
Unified Chat History — All v1 and v2 chat history now appears in a single combined list in the sidebar.
Rate Limit Banner — A banner now appears in the chat input area when you've hit a rate limit.
Time Sent in Info Panels — Text, image, and video info panels now display a "Time Sent" row.
Cost Management Charts — Charts on the cost management dashboard now include today's spending data.
Wide Screen Layout — Improved 2-column grid layout on wide screens for better use of available space.
Today's Spend Card — New summary card on the cost management dashboard showing today's total spend.
Chat Performance — Conversation window now uses lazy rendering for off-screen messages, reducing lag in long conversations.

Wallet and Payments

Insufficient Credits Banner — Low credit warnings now appear as a dismissible banner above the input field instead of blocking interaction.
x402 Wallet View — Added a dedicated wallet view and admin top-up panel on the user page for x402 balances.
Voice Conversation Billing — Audio duration is now tracked per voice conversation session for accurate credit billing.
Video Credit Retry — Video generation credits are now automatically retried when an initial charge amount fails.

Mobile App

Android Voice Mode — Voice mode is now available on Android, with a prompt to update to the latest app version.
Uncensored Model Badges — Video model selectors now display an "Uncensored" badge where applicable.
Wallet Connect on Sign-In — Crypto wallet connection is now available on the sign-in and sign-up screens.
Pay-Per-Use in Chat — Pay-per-use purchase dialog added to the chat screen.
Pay-Per-Use Confirmation — Added a confirmation step before completing pay-per-use purchases.
iOS Native Chat Streaming — Chat responses now stream using native iOS processing.
Android Native Chat Streaming — Chat responses now stream using native Android processing.
Background Chat Sync — Chat responses that streamed while the app was backgrounded sync upon returning to the foreground.
Tablet Image Modal — Image detail modal now uses a tablet-optimized layout.
Tablet Dialogs — Dialogs now adapt to tablet screen sizes.
Tablet Settings Layout — Settings screens support split-screen and tablet-optimized layouts.
Tablet Modal Screens — Modal presentation screens now adapt to tablet screen sizes.
Dynamic Image Sizing — Images now resize dynamically based on device orientation.
Settings Navigation — Fixed navigation behavior and renamed settings screens.
Rate Limit Display — Updated rate limit information in settings.
Image & Video Info Sizing — Fixed sizing on image and video detail screens.
Privacy Warning Layout — Improved button positioning on the privacy warning dialog.
Conversation Replay Fix — Fixed a bug where already-read responses would replay when re-entering a conversation.
Android Chat Reliability — Fixed chat dropping or failing during request timeouts and mid-stream disconnects on Android.
iOS Background Image Generation — Fixed image generation failing when the app is in the background on iOS.
Android Background Image Generation — Image generation now continues running when the app is in the background on Android.
Text File Chat Sharing — Restored the ability to share chat conversations as text files.
Image Loading Indicator — Progress border on the image loader now waits briefly before appearing to avoid flicker on fast loads.
Image Error Display — Image generation errors now appear inline within chat messages.
Pro Upgrade Prompt — Restored the Pro upgrade button in the app header.
Default Playback Speed — Changed the default text-to-speech playback speed to 1.2x.
Auto Mode Image Editing — Auto mode now supports editing images referenced in the chat conversation.

API

Venice Skills GitHub Repository — Official veniceai/skills repository now live on GitHub with example skills covering the full Venice API surface.
Voice Cloning API — New POST /v1/audio/voices endpoint for MiniMax-based voice cloning.
OpenAI-Compatible File Inputs — Chat completions endpoint now accepts file inputs using the OpenAI-compatible format.
Model Overloaded Status Code — Model overloaded errors now return HTTP 429 instead of 503.
maxtokens Strict Cap on Reasoning Models — On reasoning-capable models, maxtokens is now a strict cap on total completion tokens (visible output + reasoning), restoring Venice's prior behavior across the model fleet. maxcompletiontokens is accepted as an equivalent alias and takes precedence if both are sent.
API File Inputs GA — File input support in the API is now generally available, no longer in preview.
Context Length in /v1/models — New context_length field added to each model object in /v1/models responses.
Free User Rate Limit CTA — Free users now see a call-to-action prompt when they hit rate limits.
Voice Rate Limit Headers — Voice agent responses now report the current rate limit and reset time to connected clients.
Qwen Image Deprecation — The qwen-image model has been deprecated and removed from both the app and the API.
Image Edit Resolution Parameter — New resolution parameter available on the image edit and multi-edit API endpoints.
Voice Mode Quota — The API now returns the caller's remaining voice mode quota in responses.
Disabled API Tier — Added a "Disabled" API consumption tier that blocks API access for the account.
Chatterbox HD on /models — Chatterbox HD voice cloning model is now listed and documented on the /models endpoint.
Per-Model Daily Costs — The Activity API now returns daily cost breakdowns per model.
Hermes Agent Integration — Official Venice integration guide for Hermes Agent, the open-source self-hosted AI agent by Nous Research. Point Hermes at the Venice API for access to 230+ models across text, image, video, audio, and embeddings with persistent memory and autonomous skill creation.

Token

Programmatic Burn Increase — Venice increased the programmatic VVV burn for new subscriptions: $2 for Pro, $5 for Pro+, and $10 for Max. Every new subscription now triggers a larger automatic token burn.
Emissions Reduction — Venice completed the first of three planned emissions reductions for VVV, reducing the rate of new token issuance from 6M/yr to 5M/yr. Additional reductions planned in June and July.

Model Deprecations

Kimi K2 Thinking — Retired. Traffic routed to Kimi K2.5 via alias. Existing API requests using kimi-k2-thinking now resolve to kimi-k2-5
Qwen3 Coder 480B — Deprecated April 30, fully retired May 4. Traffic routed to Qwen3 Coder 480B Turbo. The non-turbo variant is no longer visible in API or app
Venice Uncensored 1.1 — Retired. All traffic routed to Venice Uncensored 1.2. API requests using venice-uncensored transparently resolve to 1.2
HiDream — Deprecation date extended to May 7, 2026 (from May 1). Email sent to affected API users
NEAR AI GLM 5.0 (E2EE) — Retired. All traffic routed to GLM 5.1 (E2EE)

Fixes and Improvements

Improved inpainting progress animation to reflect actual model processing time
Fixed app menu being clipped in landscape mode on iPad Safari
Updated execution time display to show milliseconds
Fixed gallery header action buttons being clipped on narrow viewports
Fixed thinking indicator disappearing during reasoning-only streaming
Removed incomplete trailing bucket from Per Period volume chart
Updated PPU model acknowledgment to trigger once per account instead of per conversation
Fixed model search returning unrelated results via subsequence matches on description and use case
Updated PPU acknowledgment to trigger once per conversation for every PPU modality
Condensed the x402 wallet balance table from 6 columns to 3
Removed the automatic greeting sent when opening a voice websocket connection
Fixed inpaint auto mode behavior after a recent regression
Improved Hunyuan 3D results to render GLB and OBJ mesh outputs directly in the viewer
Fixed rate limiting not being correctly applied to background removal and upscale for free-tier users
Improved error alert positioning and added a retry button for failed messages
Fixed incorrect provider names displayed in the model explorer
Fixed incorrect label displayed for vision models
Improved agentic mode loading indicator with an animated gradient border
Fixed audio crackling caused by inconsistent sample rate
Fixed Max button rounding instead of preserving full numerical precision
Fixed auto-enhance preference not being respected during image generation
Updated copy on the Pro upgrade call-to-action
Improved Model Selector layout by pinning the View All Models button to the bottom of the dropdown
Fixed aspect ratio selector appearing during single-image edits with Grok
Fixed moderate post modal closing when the context menu is dismissed
Improved reordered items in the user dropdown menu
Fixed arrow key navigation in image zoom following incorrect left/right order
Fixed credit balance not updating immediately after completing a chat request
Improved rendering performance for long conversations
Fixed Spotlight Search not respecting the top safe-area inset on PWA
Restored Lustify v7 model availability after prior deprecation
Fixed missing API keys silently returning empty results instead of an error
Fixed an error occurring when quoting video content in conversations
Improved image search results with lightbox preview, context menu support, and better error handling
Fixed chat message queue issues that could cause messages to be processed incorrectly
Improved context window handling with more accurate token counting, cost display tooltips, and smarter message compaction
Fixed interactions not responding correctly in the Model Explorer
Fixed temperature warning displaying at an incorrect baseline threshold
Fixed inability to send messages containing only an attachment without text
Fixed errors when using Grok 4.1 Fast with characters

reddit.com

u/JaeSwift — 15 days ago

▲ 8 r/VeniceAI

Venice.ai partners with StrikeRobot.ai

Venice has partnered with StrikeRobot.ai to become the primary inference API backend for their robotics products. This is Venice's first major step into robotics.

https://i.redd.it/hvpxwz2trmzg1.gif

Venice as the VLM reasoning engine inside SR Agentic:

Handles vision-language understanding, reasoning across complex environments, and natural-language reporting
Uses Venice's OpenAI-compatible API
Fast edge loop stays on-device (sub-200ms)
Paid clients get private inference and Base-settled audit logs

Venice as the inference API powering SR Platform:

Drives both Text-to-CAD and Image-to-CAD environment generation
Describe an environment or drop in a reference image - Venice handles the inference that feeds the 3D pipeline into Isaac Lab and MuJoCo

SR Agentic is B2B, but SR Platform v2 is being built for public use on Base. They're integrating Microsoft's TRELLIS.2 (4B-parameter image-to-3D model) directly into the asset pipeline.

Holders of $SR and $VVV get:

Free access to co-training with the team
Eligibility for an $SR reward pool for quality contributions
A way to help finetune the framework with real-world deviation/anomaly data

SR Platform waitlist is open at strikerobot.ai/sr-platform

Sources:

Venice announcement: x.com/AskVenice/status/2052210342172697013
StrikeRobot full thread: x.com/StrikeRobot_ai/status/2052206799512879378

reddit.com

u/JaeSwift — 15 days ago

▲ 7 r/VeniceAI

Using the agentic chat on Venice, we can do everything on the platform without the need for any external LLM or agent subscriptions.

https://reddit.com/link/1t5djy2/video/gtag8zzosizg1/player

How?

https://preview.redd.it/ii55fbpssizg1.png?width=4096&format=png&auto=webp&s=c310da570e4b40183d612d9a8caf980655e40347

1. Go to Venice to use the Agentic Chat function.

https://preview.redd.it/1rahcyrysizg1.jpg?width=612&format=pjpg&auto=webp&s=b3b50032b95c56bbb22a3d202d5d25fcd6ba8c40

2. Drop in this file containing the prompt template and the image you want to animate.
This is the image I used for this particular video & effect.

3. Tell the agent exactly what you want to see:
"Following the instructions on this document, write me a 15-second prompt that describes this model smoking her cigarette, she exhales the smoke and lies back on the hood of the car. Her body melts into the paint of the car, as she transforms into a beautiful mural."

https://preview.redd.it/l975odk6tizg1.png?width=1634&format=png&auto=webp&s=629703ded515997b929f876fb3ef4f13111d6528

4. Then we copy our prompt. We take our image into Venice Studio, where we select the Seedance 2.0 R2V model (Reference 2 Video).

Select your time, aspect ratio, and resolution.

^(by) ^(@jboogx_creative) ^(on X)

reddit.com

u/JaeSwift — 16 days ago

▲ 7 r/VeniceAI

VVV - The Privacy Coin for AI
Venice is the leading platform for private and uncensored AI and VVV is the foundational asset of Venice, built to power a programmable AI economy. You can stake VVV to earn yield, and you can lock your staked VVV (sVVV) to mint DIEM, then stake DIEM for $1 per day of API credit per token.

DIEM makes AI compute ownable, tradeable, and transferable, so capacity can move between agents, bots, and applications or be monetized without selling your VVV.

VVV is the foundation of the Venice market, and DIEM is its unit of compute.

Buy > Stake > Mint

VVV is the capital asset of Venice
VVV is a crypto token on Ethereum's Base blockchain and among the top 1% of popular tokens on Coinbase.

By buying and staking VVV you can earn yield, get access to Venice Pro, mint DIEM, and be part of the platform pushing the frontier of unrestricted intelligence.

https://preview.redd.it/szxwmesxu6zg1.png?width=2400&format=png&auto=webp&s=ffc63afa0fcac5ba0b7ee1052021fba8181c0bf6

Stake VVV to unlock Venice Pro
When you stake 100 VVV you'll enjoy free access to Venice Pro, the world's leading private and uncensored AI app.

Pro users get unlimited text prompts, leading generative image and video models, and advanced features.

STAKE VVV

https://preview.redd.it/qkvdbgx2v6zg1.png?width=1024&format=png&auto=webp&s=d63044b5d8f6f529443f8d654f91aee9af94dea7

Mint DIEM with VVV
DIEM provides perpetual, ongoing access to the world's top AI models.

1 DIEM = $1 of AI credit every day.

All DIEM is created from VVV. By locking your staked VVV, you can mint DIEM and use it, or sell it to other AI consumers.

READ MORE ABOUT DIEM

https://i.redd.it/mkzc6r6qv6zg1.gif

Monthly VVV Burn

Starting Nov 2025, Venice uses a portion of monthly revenue to buy and burn the VVV token on an ongoing basis.

You can track the monthly burns directly through your token dashboard.

SEE VENICE BURNS
____________________

VVV Tokenomics

https://preview.redd.it/mrqwyx07w6zg1.jpg?width=1360&format=pjpg&auto=webp&s=ced3e35459657b8c498104d83d8a38c4745761f7

VVV launched on January 27th 2025 with a starting supply of 100M tokens.
Up-to-date supply numbers here.

VVV is engineered as a long-term deflationary capital asset of the Venice AI platform. As Venice scales, VVV becomes more scarce.

We are continually reducing emissions and in December 2025, Venice started buying VVV from the market with a portion of revenue and burning it every month - permanently removing tokens from circulation.

This creates a powerful feedback loop:
More Revenue → More Buy & Burns → Less Supply → Deflationary VVV

Buy on:

____________________

FAQ:

^(Where can I learn more about the Venice token (VVV)?)
^(Learn more about VVV through our official) ^(token launch announcement blog post)^(.)
^(Learn more about DIEM through our DIEM) ^(technical breakdown blog post)^(.)
^(You can also join our community on Discord for updates:) ^(https://discord.gg/askvenice)
^(or visit the Token section on our) ^(FAQ page) ^(for additional information.)

^(What is the Contract Address for the Venice token (VVV)?)
^(The Venice token contract address is) ^(0xacfE6019Ed1A7Dc6f7B508C02d1b04ec88cC21bf)
^(This is the smart contract address for VVV on Base.)
^(You can view the contract, balances, and transaction history on) ^(BaseScan)^(.)

^(How does VVV staking yield work?)
^(After the) ^(DIEM) ^(upgrade the Utilization Rate split is removed. VVV stakers receive 100% of emissions as yield paid in VVV. If your sVVV is locked to back minted) ^(DIEM) ^(you earn 80% of the standard staking yield while locked and 20% goes to Venice.)

^(When are staking rewards paid out?)
^(Staking rewards accumulate continually and you can withdraw them whenever you wish.)

reddit.com

u/JaeSwift — 18 days ago

▲ 8 r/VeniceAI

Venice users love the Grok model suite by xAI and its now the fastest growing on Venice in terms of usage.

https://reddit.com/link/1t3um6q/video/o5ud3wkjo6zg1/player

All fully private. Zero data retention.

┏ ⑅ ━━━━━━━━━━━━━ ⑅ ┓

Grok 4.3
Grok 4.20
Grok 4.20 Multi-Agent
Grok 4.1 Fast
Grok Imagine
Grok Imagine Pro
Grok Imagine Video
Grok Imagine Edit
Plus xAI TTS and STT for voice

┗ ⑅ ━━━━━━━━━━━━━ ⑅ ┛

Try the full suite now on Venice.ai

reddit.com

u/JaeSwift — 18 days ago

▲ 6 r/VeniceAI

Claude Code is Anthropic’s CLI tool for agentic coding.

This guide shows you how to run it through Venice AI for pay-per-token access to Claude Opus 4.6/4.7 and Claude Sonnet.

https://preview.redd.it/vbp34qb5n6zg1.png?width=1280&format=png&auto=webp&s=62a13685671c34a33f3c489c7f44fcd7252a81fc

Why You Need a Router
Claude Code connects directly to Anthropic’s API by default. To use it with Venice, you need claude-code-router, an open-source local proxy that:

Intercepts
Catches Claude Code’s outgoing requests before they reach Anthropic

Transforms
Converts request format and maps model IDs (e.g., claude-opus-4-5)

Redirects Forwards requests to Venice at api.venice.ai/api/v1/chat/completions

Setup

1. Install Claude Code

If you haven’t already, install Anthropic’s Claude Code CLI:

npm install -g /claude-code

2. Install the Router

npm install -g u/musistudio/claude-code-router

3. Get Your API Key

Generate a key from venice.ai/settings/api. You’ll paste it directly in the config file in the next step.

4. Create Configuration

Create the config directory:

mkdir -p ~/.claude-code-router

Then create ~/.claude-code-router/config.json with your preferred editor:

# Using nano
nano ~/.claude-code-router/config.json

# Or using VS Code
code ~/.claude-code-router/config.json

Paste the following configuration:

{
  "APIKEY": "",
  "LOG": true,
  "LOG_LEVEL": "info",
  "API_TIMEOUT_MS": 600000,
  "HOST": "127.0.0.1",
  "Providers": [
    {
      "name": "venice",
      "api_base_url": "https://api.venice.ai/api/v1/chat/completions",
      "api_key": "your-venice-api-key-here",
      "models": [
        "claude-opus-4-5",
        "claude-sonnet-4-5",
        "claude-opus-4-6",
        "claude-opus-4-6-fast",
        "claude-opus-4-6",
        "claude-opus-4-7",
        "claude-sonnet-4-6"
      ],
      "transformer": {
        "use": ["anthropic"]
      }
    }
  ],
  "Router": {
    "default": "venice,claude-opus-4-7",
    "think": "venice,claude-opus-4-7",
    "background": "venice,claude-opus-4-7",
    "longContext": "venice,claude-opus-4-7",
    "longContextThreshold": 100000
  }
}

^(🛈) ^(If you modify config.json while the router is running, restart it with ccr restart to apply changes.)

5. Launch

Start the router, then Claude Code:

ccr start
ccr code

Or use the activation method:

eval "$(ccr activate)" &amp;&amp; claude

Supported Models

Model	Venice ID	Best For
Claude Opus 4.5	`claude-opus-4-5`	Complex reasoning, large refactors
Claude Sonnet 4.5	`claude-sonnet-4-5`	Fast iteration, everyday coding
Claude Opus 4.6	`claude-opus-4-6`	Complex reasoning, large refactors
Claude Opus 4.6 Fast	`claude-opus-4-6-fast`	Complex reasoning with lower latency
Claude Sonnet 4.6	`claude-sonnet-4-6`
Claude Opus 4.7	`claude-opus-4-7`	Complex reasoning, large refactors

^(🛈 Claude Code is optimized for Claude models. While other models available through Venice) ^((GPT, DeepSeek, Grok, etc.)****) ^(may work, we cannot guarantee an equivalent experience since Claude Code relies on Claude-specific features like extended thinking. For other models, consider using Venice’s) ^(standard API)^(.)

· · • • • ✤ • • • · ·

Router Features
The router provides several useful features beyond basic routing:

Switch models on the fly
- Use the /model command inside Claude Code to switch models without restarting: /model venice,claude-sonnet-4-6
- ^(Useful when you want Opus for complex tasks and Sonnet for quick iterations.)
Visual configuration with UI mode
- Prefer a GUI? Launch the web-based config editor: ccr ui
- ^(This opens a browser interface for editing your config.json without touching the file directly.)
Router scenarios explained
- The Router config section controls which model handles different task types:

Scenario	When it’s used
`default`	General requests
`think`	Reasoning-heavy tasks (Plan Mode)
`background`	Background operations
`longContext`	When context exceeds `longContextThreshold` tokens

You can route different scenarios to different models. For example, use Sonnet for background tasks to save costs.

Debugging with logs
- If something isn’t working, check the logs:
  - Server logs (HTTP, API calls): ~/.claude-code-router/logs/ccr-*.log
  - Application logs (routing decisions): ~/.claude-code-router/claude-code-router.log

Set "LOG_LEVEL": "debug" in your config for more verbose output.

Caching Behaviour

Scenario	Cache TTL	Who Controls
Default (recommended)	5 minutes	Claude Code + Venice
With `cleancache` transformer	1 hour	Venice only

When NOT to use cleancache (most users)

The default configuration lets both systems cooperate:

Claude Code sends its native cache_control markers
Venice adds caching around them with a 5-minute TTL
Both systems share the 4-block cache limit

This works well for active coding sessions where you’re making frequent requests.

When to use cleancache

Add cleancache to the transformer if you:

Are hitting the 4-block cache limit errors
Experience strange caching behavior
Prefer Venice’s 1-hour TTL for longer sessions"transformer": { "use": ["anthropic", "cleancache"] }

This strips Claude Code’s cache markers, giving Venice full control with a longer TTL.

· · • • • ✤ • • • · ·

Resources

Venice API documentation
- Full API reference
Claude-Code-Router
- Source code & issues

reddit.com

u/JaeSwift — 18 days ago

▲ 5 r/VeniceAI

https://reddit.com/link/1t0yfb1/video/akk78ubazjyg1/player

This is the first of 3 monthly reductions:

May 1: 6M → 5M
June 1: 5M → 4M
July 1: 4M → 3M

Our goal is a net deflationary VVV with native yield, where burns exceed emissions

reddit.com

u/JaeSwift — 21 days ago