Tag
3 articles
AI benchmarking startup Arcada Labs is testing five leading AI models as autonomous agents on X, evaluating their real-world social media capabilities.
Microsoft Research introduces CORPGEN, a framework enabling autonomous AI agents to manage complex, multi-horizon tasks in corporate environments through hierarchical planning and memory.
Nous Research has released Hermes Agent, an open-source AI system designed to overcome the forgetfulness of traditional LLMs by implementing multi-level memory and remote terminal access support.