Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents — AI Alignment Forum