Wang, X., Li, B., Song, Y., Xu, F. F., Tang, X., Zhuge, M., Pan, J., Song, Y., Li, B., Singh, J., Tran, H. H., Li, F., Ma, R., Zheng, M., Qian, B., Shao, Y., Muennighoff, N., Zhang, Y., Hui, B., … Neubig, G. (2024). OpenHands: An Open Platform for AI Software Developers as Generalist Agents. arXiv:2407.16741. https://arxiv.org/abs/2407.16741
@unpublished{openhands,
title = {{OpenHands: An Open Platform for AI Software Developers as Generalist Agents}},
author = {Wang, Xingyao and Li, Boxuan and Song, Yufan and Xu, Frank F. and Tang, Xiangru and Zhuge, Mingchen and Pan, Jiayi and Song, Yueqi and Li, Bowen and Singh, Jaskirat and Tran, Hoang H. and Li, Fuqiang and Ma, Ren and Zheng, Mingzhang and Qian, Bill and Shao, Yanjun and Muennighoff, Niklas and Zhang, Yizhe and Hui, Binyuan and Lin, Junyang and Brennan, Robert and Peng, Hao and Ji, Heng and Neubig, Graham},
year = {2024},
eprint = {2407.16741},
archiveprefix = {arXiv},
primaryclass = {cs.SE},
url = {https://arxiv.org/abs/2407.16741},
note = {arXiv:2407.16741},
arxiv = {2407.16741}
}
Software is one of the most powerful tools that we humans have at our disposal; it
allows a skilled programmer to interact with the world in complex and profound ways. At the
same time, thanks to improvements in large language models (LLMs), there has also been a rapid
development in AI agents that interact with and affect change in their surrounding
environments. In this paper, we introduce OpenHands (f.k.a. OpenDevin), a platform for the
development of powerful and flexible AI agents that interact with the world in similar ways to
those of a human developer: by writing code, interacting with a command line, and browsing the
web. We describe how the platform allows for the implementation of new agents, safe
interaction with sandboxed environments for code execution, coordination between multiple
agents, and incorporation of evaluation benchmarks. Based on our currently incorporated
benchmarks, we perform an evaluation of agents over 15 challenging tasks, including software
engineering (e.g., SWE-BENCH) and web browsing (e.g., WEBARENA), among others. Released under
the permissive MIT license, OpenHands is a community project spanning academia and industry
with more than 2.1K contributions from over 188 contributors.