Discussion about this post

User's avatar
A student's avatar

Excellent article that I wish everyone in the corporate ranks would read. It's tiring listening to everyone assume that agents will perfectly swap out human tasks. The same degradation of process happens when human efforts are strung together, so why is everyone not recognizing that this is the case with agents as well? It's biased thinking that AI is going to just never experience this I see from anyone who hasn't actually tried this on a real process and not just a toy example published to a reel somewhere.

What I've experienced as I've done these automations is that I wind up being the Oracle. So I showcase my new automated process but hide the fact that I spent a large amount of time guiding the process to the result. I wish more people would be honest about this.

I actually shifted to the self critique method early on and always used a different LLM to critique the result of another LLM. This produced much better results, however, it takes a really long time and jacked up my token usage. For example in image generation I got from maybe 40% acceptable to almost 90% acceptable but this took on average 10 extra loops through the image gen because of the extra LLM critiques. So I basically did 10 gens to get from 40-90 but it still was not good enough. still required the oracle to guide it from 90 to 99!

Florent Michel's avatar

Very interesting article, which highlights a crucial point too often neglected in discussions of AI trajectory. I wonder if one should not even go one step further: in many areas, a failure probability of 20% is still way too large to be useful. (Although this does not invalidate your arguments!)

2 more comments...

No posts

Ready for more?