Sorry noob question - where can I read more about this "agents" paradigm? Is one agent's output directly calling/invoking another agent? Or there's already fixed graph of information flow with each agent (I presume some prompt presets/templates like "you are an expert this only respond in that") sorts of?
Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents together themselves
In practice this means function calling - the LLM chooses the function to call (and its parameters). Usually in a loop with a 'finish' function that returns the control to the outside code.
You can do that without function calling - as did the original ReAct paper - but then you have to write your own grammar for the communication with the LLM, a parser for it, and also you need to teach the LLM to use that grammar. This is very time consuming.
> Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents themselves together?
There’s a few startups in the space doing this like QA Tech in Stockholm, and others even in YC (but I forgot the name). I’m skeptical of how successful they’ll be, not just from complex test cases but things like data management and mistakingly affecting other tests. Interesting to follow just in case though, E2E is a pain!
Fundamentally, "Agent" refers to anything that operates in an "observe-act" loop. So in the context of LLMs, an agent sees an observation (like the code base and test output) and produces an action (like a patch), and repeats.
Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents together themselves
EDIT: Typos