r/ETL

▲ 8 r/ETL

What are the best data integration tools in 2026?

Hey everyone,

I'm evaluating data integration tools heading into Q3 2026 and would love to hear what's actually working for people right now. The landscape has shifted a lot in the last year or two (more reverse ETL, more zero-copy/data sharing, AI-assisted pipelines, etc.) and I want to cut through the marketing.

A few things I'd love your input on:

- What tool(s) are you using and roughly what's your stack/scale?

- What do you love about it?

- What are the gotchas or things you wish you'd known before adopting it?

- Anything you've migrated away from and why?

Open to hearing about Scaylor, Fivetran, Airbyte, Estuary, Hevo, Matillion, dbt + custom, Meltano, or anything else I'm not thinking of.

Thanks in advance!

reddit.com
u/AceClutchness — 1 day ago
▲ 3 r/ETL

How do ETL teams handle schema changes without breaking downstream pipelines?

Im curious about the practical strategies used in production ETL systems when source tables or API structures change unexpectedly.

reddit.com
u/Effective_Ocelot_445 — 3 days ago
▲ 10 r/ETL

Looking for on premise ETL tool. Sources .CSV files and Salesforce.

HI,

I am looking for an on premise ETL tool primarily to handle Transforming and loading data. And possibly something that can be automated/schedule to execute Stored Procedures and queries.

We don't need cloud storage or reporting, that is done through Microsoft Fabric and PowerBI.

(current fabric licenses are allocated through our parent company, and I can not use them - Some weird "separation of entity" legal red tape as they are based outside of the US.)

Data Sources: .CSV files and SalesForce.

Destination: SQL server and if possible, a push back to Salesforce.

We have a very small budget of 10K annually. Total of 2 users.

Any recommendations would be helpful. (SSIS isn't possible, since we use an Azure SQL and thus can't bill it under the parent companies Microsoft licenses).

reddit.com
u/PandaRiot_90 — 7 days ago
▲ 11 r/ETL+1 crossposts

Been building CRMs, automations, and dashboards on Base44 lately

Been deep into Base44 lately and honestly enjoying the whole vibe coding/building space.

I’ve been working on platforms involving job boards, CRM-style workflows, automations, dashboards, onboarding systems, AI-assisted forms, candidate management, subscriptions, and internal admin tools — mostly focused on making things practical and usable instead of overcomplicated.

Still learning new things every day, but if anyone here is building something on Base44 (or thinking about it) and wants another brain to bounce ideas with, troubleshoot workflows, improve UI/UX, or help structure systems, feel free to reach out.

Happy to connect with fellow builders and those curious to develop their own getting little help in the space 👌

reddit.com
u/Upstairs_Stop_3821 — 8 days ago
▲ 2 r/ETL

How do ETL teams handle data validation efficiently in large scale pipelines?

I’m curious about the practical approaches used in production ETL systems to detect bad or inconsistent data before it impacts downstream analytics.

reddit.com
u/Effective_Ocelot_445 — 8 days ago
▲ 6 r/ETL+2 crossposts

BiqQuery - larger dataset issue

Has anyone had an issue when trying fetch 20k+ records from BiqQuery to Postgres DB? Everything works fine if I keep it under 10k, using Table Input + SQL, but as soon as I try more records the pipeline fails. Odd Java error message. Ultimately, I am looking to move like 500k records from BQ to Postgres DB.

reddit.com
u/zadrogasauce — 12 days ago
▲ 10 r/ETL+2 crossposts

We built an open-source IaC tool for Snowflake, here's how it works

Most Snowflake setups end up as a mix of tools, scripts, and manual clicks. We built Snowcap to handle it all in one place: warehouses, roles, grants, masking policies, dynamic tables, etc.

No state file. It queries Snowflake directly on every run and generates the SQL to match your config. If someone makes a change outside the tool, it catches it next run.

We wrote up the full overview here: https://datacoves.com/post/snowcap-snowflake-infrastructure-as-code

Happy to answer questions if anyone's dealing with Snowflake RBAC or provisioning headaches.

u/Data-Queen-Mayra — 10 days ago
▲ 0 r/ETL

A tool to catch schema drift and API changes before they break your ETL pipelines. Looking for feedback!

Most pipelines break because an upstream source changed without warning. I built a platform to catch these issues before they crash your ETL.

What it does:

  • Schema Monitoring: Detects renamed columns, dropped fields, or type changes in real-time.
  • Uptime Checks: Verifies your APIs and Databases are online before the pipeline runs.
  • Instant Alerts: Notifies you the moment drift is detected or any problem with the source.
  • Simple Setup: Connect your SQL DBs or REST APIs in under 2 minutes.

Would you use it and what features would make this a "must-have" for your workflow? Thanks!

reddit.com
u/West-Candidate-2708 — 12 days ago