We’ve been working on a retrieval system for teams building AI agents in finance.
(mainly around workflows that need to do in-depth web research).
A few patterns we keep running into:
- cost per query gets high quickly with deep research flows
- latency makes it hard to use in real workflows ( not the quick superficial simple search)
- bloated context windows
Anyone here who is running ai agents in production or uses deep research APIs regularly:
- what is your experience with using those for automations of the financial research tasks?
Would really appreciate any examples of a better approach or any other challenges you see that we are still going to get into.