r/agenticaidev

I built a Codex session review app using Codex. How are you tracking your AI coding workflows?
▲ 9 r/agenticaidev+5 crossposts

I built a Codex session review app using Codex. How are you tracking your AI coding workflows?

I built a small free macOS tool for reviewing Codex sessions using the Codex desktop app. Are people here using anything similar to improve their AI coding workflows?

After longer Codex runs, I kept finding that the transcript was technically available, but hard to review.

The things I wanted to inspect were:

- What changed

- Which files were touched

- Where tokens went

- Which tool calls mattered

- Whether the prompt/context was good enough to reuse

- What context would be useful to share during code review

So I made BuildrAI, a local-first app that turns Codex session artifacts into timelines, token usage, prompt/session evaluation, changed-file context, and shareable reports.

I’m curious how other people are handling this.

Do you review Codex sessions after the fact, or do you mostly trust the final diff?

u/michaliskarag — 1 day ago
▲ 19 r/agenticaidev+3 crossposts

KiroEnsemble: an enterprise-grade multi-agent framework that turns Kiro CLI into a full dev team 🚀

Kiro CLI is powerful for single tasks. KiroEnsemble turns it into an entire development team.

If you use Kiro CLI, you know it's great for single tasks at your desk. But I wanted it to carry a whole feature the way a real developer does:

  • plan it
  • build it
  • test it
  • review it
  • document it
  • and open the PR
  • without me driving every keystroke.

So I turned it into a team. A lead agent orchestrates and never writes code itself: a builder writes the code, a validator runs the tests and checks the spec, a reviewer diffs the branch, and a documenter writes the docs

You hand it a ticket or a spec, and it delivers a tested, reviewed, documented, PR-ready change. No babysitting.

This isn't a toy. It's enterprise-grade, used by engineers within a renowned international company.

📊 It runs as a full-stack developer on live enterprise codebase:

in one month, straight from personal session logs (using /record-session custom skill):

- 20+ tickets shipped across 4 repos

- 89% completed clean (18 success, 2 partial, 0 failed)

- Features and bug fixes delivered, with tests, documentation and code review on every run

It picks up a ticket, builds it, tests it, reviews its own diff, documents it, and hands back a PR. That's the job of a full-stack dev, and it does it on real, conventions-heavy enterprise repos.

🏗 It also builds projects from scratch:

Point it at an empty directory and it ships. My portfolio site (https://mmo.sidihub.cloud/) and its AI assistant were built end to end by this team, from spec to deployment.

  • ⚙️ Why it holds up where other agent setups fall apart
  • 🧠 Real separation of roles, not one model pretending to be five.
  • 📋 Spec-driven: requirements, design, and tasks, so it builds what you asked for.
  • 🪶 Lean orchestrator: the lead never loads your code, so context stays sharp across long runs.
  • 🔁 Self-correcting: bounded retries with a diagnostician pass before giving up.
  • 🎫 Process-aware: reads tickets, follows your branch and commit conventions, opens the MR, posts the summary.
  • ✍️ Grounded knowledge and context through typed files & can be integrated with obsidian
  • 🔒 Safe by default: a guard blocks destructive commands and asks before anything risky.

🛠 How it works

Clone the repo, copy the .kiro setup into your desired repo, and either manually edit the conventions and names to fit your needs & project, or ask kiro to adapt it, 5~ minutes setup.

  1. Prepare a Spec or a Plan document for what you want to work on.
  2. Start a kiro-cli session -> type `/agent swap team-lead`
  3. prompt the team-lead to pickup the spec and start the agentic workflow

NOTE: Agents have pre-chosen models that I personally use, edit the agent configs to change the models to your credit budget.

GitHub Repo: https://github.com/Moifek/kiro-ensemble

Free and open-source (MIT). Clone it, run it on your own project, and tell me where it breaks.

Roadmap: Better logging & Mobile integration

What's the most complex thing you'd trust an agent team to ship for you?

Hey u/few_Map7816 I'll be trying to make this work with your solution :D let's connect !

u/DaraosCake — 3 days ago

Auditing agentic AI in a regulated environment, how are you handling compliance?

We're a healthcare company with strict HIPAA requirements. We've been running traditional ML models in production for years, deployed via Seldon Core on-prem. The audit trail is solid: we know who deployed what model, what data was used, and what predictions were made.

Now we're exploring agentic AI for clinical decision support. The problem is auditability. An agent doesn't just make a single prediction. It reasons, calls multiple tools, iterates, and produces a final output. Compliance wants to know every step: which agent made which decision, what tools were called, what LLM was used, and what data was accessed.

Most agentic platforms are SaaS-first and don't support on-prem deployment. Even the ones that do have weak auditing compared to what we're used to with Seldon. We need a solution that can run in our air-gapped environment, meets HIPAA requirements, and gives us a complete audit trail across the entire agentic workflow. Is anyone else navigating this? What's working?

reddit.com
u/Terrible-Market1264 — 6 days ago
▲ 12 r/agenticaidev+5 crossposts

chatgpt say 5000$

I have target to complete this coure in next 2 month?? at what level can i reached?? at last there are some project also?? can i make money through it??

u/Free-Candidate-9198 — 13 days ago
▲ 1 r/agenticaidev+1 crossposts

need help on tool running before passing to llm

hi,
i'm shan from India. I'm learning Langchain. Since I'm a total noob in python and langchain i'm not able to test my tools in jupyter notebook one by one.

Actually, i'm trying a simple get_weather agent. but i couldn't run the tool before running the llm for testing. Kindly, help me.

Here is my coe:

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

get_weather.invoke({"location":"beijing"})

import os
import json
from dotenv import load_dotenv, find_dotenv
from langchain_deepseek import ChatDeepSeek


from urllib.parse import quote
from httpx import request

def get_weather(location:str)->str:
    """
    this is used to get the weather of a city anywhere in the world. 


    Args:
        location (str): specify the location to get its weather. eg) 'New York', 'Beijing', 'New Delhi' e.tc 


    Returns:
        return a string that explains that city's weather in celsius, fareheight and how the weather is.
    """
    key=os.environ.get("WEATHER_API_KEY")
    url="https://api.weatherapi.com/v1/current.json?q={location}&key={key}"
    encoded_url=quote(url)
    response=request.get(encoded_url).json()
    if response:
        return f"weather data for **{location}**: celsius: {response["current"].temp_c} farenheight: {response["current"].temp_f}, day is: {response["current"]["condition"].text}"
    #    data=response.json()
    #    return data
    else:
        return f"Weather API failed retry after some time"



  get_weather.invoke("beijing")

for this i get the error:

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
Cell In[28], line 1
----> 1 get_weather.invoke("beijing")

File ~/Documents/Agentic-experiments/tools-experiments/.venv/lib/python3.12/site-packages/langchain_core/tools/base.py:738, in BaseTool.invoke(self, input, config, **kwargs)
    730 
    731 def invoke(
    732     self,
   (...)    735     **kwargs: Any,
    736 ) -> Any:
    737     tool_input, kwargs = _prep_run_args(input, config, **kwargs)
--> 738     return self.run(tool_input, **kwargs)

File ~/Documents/Agentic-experiments/tools-experiments/.venv/lib/python3.12/site-packages/langchain_core/tools/base.py:1100, in BaseTool.run(self, tool_input, verbose, start_color, color, callbacks, tags, metadata, run_name, run_id, config, tool_call_id, **kwargs)
   1098 if error_to_raise:
   1099     run_manager.on_tool_error(error_to_raise, tool_call_id=tool_call_id)
-> 1100     raise error_to_raise
   1101 output = _format_output(content, artifact, tool_call_id, self.name, status)
   1102 run_manager.on_tool_end(output, color=color, name=self.name, **kwargs)

File ~/Documents/Agentic-experiments/tools-experiments/.venv/lib/python3.12/site-packages/langchain_core/tools/base.py:1066, in BaseTool.run(self, tool_input, verbose, start_color, color, callbacks, tags, metadata, run_name, run_id, config, tool_call_id, **kwargs)
   1064     if config_param := _get_runnable_config_param(self._run):
   1065         tool_kwargs |= {config_param: config}
-> 1066     response = context.run(self._run, *tool_args, **tool_kwargs)
   1067 if self.response_format == "content_and_artifact":
   1068     msg = (
   1069         "Since response_format='content_and_artifact' "
   1070         "a two-tuple of the message content and raw tool output is "
   1071         f"expected. Instead, generated response is of type: "
   1072         f"{type(response)}."
   1073     )

File ~/Documents/Agentic-experiments/tools-experiments/.venv/lib/python3.12/site-packages/langchain_core/tools/structured.py:97, in StructuredTool._run(self, config, run_manager, *args, **kwargs)
     95     if config_param := _get_runnable_config_param(self.func):
     96         kwargs[config_param] = config
---> 97     return self.func(*args, **kwargs)
     98 msg = "StructuredTool does not support sync invocation."
     99 raise NotImplementedError(msg)

Cell In[25], line 41, in get_weather(location)
     37     """
     38     key=os.environ.get("WEATHER_API_KEY")
     39     url="https://api.weatherapi.com/v1/current.json?q={location}&key={key}"
     40     encoded_url=quote(url)
---> 41     response=request.get(encoded_url)
     42     if response:
     43         # return f"weather data for **{location}**: celsius: {response["current"].temp_c} farenheight: {response["current"].temp_f}, day is: {response["current"]["condition"].text}"
     44        data=response.json()

AttributeError: 'function' object has no attribute 'get'

how to resolve this and get results in testing what the tool prints.

reddit.com
u/WinnerPristine6119 — 10 days ago