r/DuckDB

▲ 1 r/DuckDB

Duckdb-WASM precompiled 1.5 version

Hi all,

currently my application uses Duckdb-wasm precompiled using version 1.32.0. Does anyone know when they will create a precomiled version for 1.5.0. i am pulling via NPM.

reddit.com
u/pchappo — 4 days ago
▲ 17 r/DuckDB

What would you expect from a good visual Parquet workflow for DuckDB users?

Hi DuckDB community,

Disclosure: I’m from Paradigma Software, the team behind Valentina Studio. 

Some users asked us whether Valentina Studio could support working with local Parquet files.

In response, we added Parquet support in Valentina Studio 17.4, and I’m curious whether it covers the workflows DuckDB users actually need.

In the Free edition user can open Parquet file, read it, work with it in SQL Editor, having AI Chat assistance.

In the Pro edition more tools integrated to work/edit/manage parquet files in the same way as a local DB: for example DuckDB files or SQLite files.

  • Schema Editor — inspect Parquet schema and column types
  • Data Editor — browse file content visually, including nested/list/struct values
  • Data transfer - move records from/to Parquet to/from any other supported DB.
  • Report Editor - Parquet can be used as data source.

For those of you who work with Parquet regularly — do you have any wish-list for features or workflow improvements?

P.S. VStudio available on Mac, Windows 32/64/arm, Lin x64/arm

u/ruslan_zasukhin — 5 days ago
▲ 28 r/DuckDB

Tired of grep | awk | sort | uniq -c for log debugging — built a CLI that pipes them into a browser dashboard

Standard story: prod alert fires, I SSH in, pull down a few hundred MB of JSONL logs, then spend 20 minutes building the same one-liner I always build:

grep ERROR app.log | grep -v noise | awk '{print $5}' | sort | uniq -c | sort -rn

Works, but it's terminal-only and I can't slice by a second dimension without rewriting it. Wanted something closer to a Splunk/Datadog view but for ad-hoc local files.

So I built a CLI that pushes local logs into a browser-side DuckDB instance:

npx duckviz ./logs/ -r

It opens a tab with the files already loaded — timeline, group-by, filter, chart. Logs never leave your machine; the CLI runs a loopback HTTP server with a bearer token and the browser pulls files locally (DNS-rebinding guard, manifest-gated, all that).

Supports JSON, JSONL, CSV, XML, and unstructured logs (LLM detects the format on first load).

Disclosure: Genuinely curious what shape of logs breaks it — drop a sample if you have a weird one.

u/viksdev — 8 days ago
▲ 15 r/DuckDB

DuckDB 1.5 introduced the new VARIANT type, so we added support for it in Valentina Studio 17.3.

https://preview.redd.it/okxv3i49p90h1.png?width=1536&format=png&auto=webp&s=24587f8e3f6db2eab1bebd7c7efc3642e6eff8da

Current support includes:

  • Schema Editor integration
  • Visual inspection/editing of nested objects & arrays
  • special editors for images, blobs, UUIDs, etc.
  • AI-assisted SQL Editor
  • Direct editing of VARIANT values in Data Editor

Available on macOS, Windows and Linux.

Curious how DuckDB users currently work with semi-structured data and whether visual tooling around VARIANT is useful in practice.

More details:
https://valentina-db.com/dokuwiki/doku.php?id=valentina:articles:vstudio_v173_duck_variant

reddit.com
u/ruslan_zasukhin — 12 days ago
▲ 13 r/DuckDB+2 crossposts

Text-to-SQL with DuckDB

I’m currently creating a modular text-to-SQL library, for this I need to run integration tests regularly and evals preferably on databases with a variety of tables. DuckDB has been perfect for this as I can run it locally without the cost of storing and processing my sample database in a cloud data warehouse. The TPC-H extension also meant I could populate my DuckDB database really easily too. I’ve outlined where this is up with a DuckDB example in this video. Sharing as I think DuckDB has been so useful for me in this scenario. Text-to-SQL: using piglets to prepare your context with u/duckdb and u/OpenAI

https://youtu.be/cNXm1t_4mh0

u/mportdata — 13 days ago
▲ 8 r/DuckDB

Using DuckLake with Azurite (DuckDB 1.4.4 vs 1.5.2) — experience & issues

Return of experience using ducklake with Azurite.
DuckDB version tried 1.4.4(LTS) and then 1.5.2 both using duckdb ui and python library.
Setup windows laptop using azurite vscode extension stock .
Uploaded a 20M lines table (+/-350mb parquet) on Azurite blob storage.

Some early difficulty connecting to the storage.
1^(st) try: fairly simple to create a table from the blob storage and manipulate it.
Experience is quick and easy (python and duckdb -ui 1.4.4).
2^(nd) try: Ok let’s try the encrypted version (still using python and duckdb -ui 1.4.4).

Doesn’t work at all seem like it tries to make too many (first 2x64MB blocks write successfully, then crashes with Error: 409 The committed block count cannot exceed the maximum limit of 50,000 blocks). Tested on another laptop same issue.
3^(rd) try: Before complaining I figured I should try the latest version of duckdb.
So tried the same thing with 1.5.2 (python and ui).

Unencrypted works great.

Encrypted doesn’t
It doesn’t crash but is slow enough that I don’t think it’s usable (2 hours to “upload” on local azurite).

Since there are a lot of things I might have missed , a lot of friction points and unknown unknown I’m checking with the community to see if anyone got an idea of what I could improve.

Anyone got the same experience or poc that work well with similar setup ? Or any comment ?

reddit.com
u/throwawayforwork_86 — 14 days ago